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BACKGROUND OF THE INVENTION 



This application claims priority to U.S. Provisional Patent Application No. 
60/437,928 filed on January 3, 2003, U.S. Provisional Patent Application No. 60/446,942 
5 filed on February 12, 2003, and U.S. Provisional Patent Application No. 60/474,826 filed 
on May 30, 2003, all of which are incorporated by reference in their entireties. 

This invention was made with government support under grant number GM61393 
awarded by the National Institutes of Health. The government has certain rights in the 
invention. 

10 1. Field of the Invention 

The present invention relates generally to the fields of molecular genetics, 
pharmacogenetics, and cancer therapy. In particular, the present invention is directed to 
methods and compositions for detecting polymorphisms and correlating the presence or 
absence of certain polymporphisms with toxic effects of chemotherapies. More 

15 specifically, the present invention is directed to methods and compositions for 
determining the presence or absence of polymorphisms within a uridine diphosphate 
glucuronosyltransferase I Al (UGT1A1) promoter and correlating these polymorphisms 
with toxic effects of irinotecan, as well as evaluating the risk of an individual for 
developing irinotecan toxicity. In some embodiments, the invention concerns methods 

20 and compositions for predicting or anticipating the level of toxicity caused by irinotecan 
and other compounds glucuronidated by a UGT enzyme in a patient. Such methods and 
compositions can be used to evaluate whether irinotecan-based therapy or therapy 
involving a UGT substrate may pose toxicity problems if given to a particular patient. 
Alterations in suggested therapy may ensue if a toxicity risk is assessed. 

25 2. Description of Related Art 

Glucuronidation plays a major role in the pharmacological activity and clearance 
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of a large variety of compounds (Tukey and Strassburg, 2000). Genetic studies of UDP- 
glucuronosyltransferases (UGTs) aim to characterize an individual's predisposition to 
various diseases and increased risk of adverse outcome to drug treatment. The variation 
in the UDP-glucuronosyltransferase 1A1 (UGT1A1) gene is the most extensively studied. 
5 UGT1A1 basal expression is affected by the variable number of TA repeats in the TATA 
box, i.e., (TA) n , see U.S. Patent 6,395,481, which is incorporated herein by reference. A 
variable number of repeats (5, 6, 7, and 8) have been found in the UGT1A1 TATA box. 
Gene transcriptional efficiency has been inversely correlated to the number of TA repeats 
(Beutler et al, 1998). Thus, a larger TA repeat number is associated with reduced 
10 transcriptional activity (Beutler et al, 1998) leading to various degrees of impaired 
glucuronidation of UGT1 Al substrates. 

Homozygosity for (TA) 7 allele is associated with Gilbert's syndrome (a familial 
mild hyperbilirubinemia) (Bosma et al, 1995 and Monaghan et al, 1996) and 
predisposition to the toxic effects of cancer treatment with irinotecan (Ando et al. , 2000 

15 and Iyer et al, 2002). Gilbert's syndrome has also been associated with missense coding 
variants in the UGT1A1 gene, in particular in Asian populations where these variants are 
relatively common. Increased risk of breast cancer was reported in African-American 
women who carried the (TA) 7 and (TA)g alleles (Guillemette et al , 2000). In addition to 
the TATA box, Sugatani et al, (2001) identified a region in the UGT1A1 promoter 

20 approximately 3 kb upstream of the TATA box that regulates UGT1A1 inducibility by 
phenobarbital. It is also hypothesized that this phenobarbital-responsive enhancer module 
(PBREM) might be modulated by endogenous factors (Sugatani et al, 2002). UGT1A1 
activity is probably the result of PBREM-dependent modulation of TATA box-dependent 
basal expression. 

25 Polymorphisms in UGT1A1 are relevant to the treatment of cancer patients with 

irinotecan. Irinotecan is a topoisomerase I inhibitor that is approved worldwide for the 
treatment of metastatic colorectal cancer. Irinotecan has a well established role as single 
agent in 5-fluorouracil-refractory patients (Rougier et al, 1998; Cunningham et al, 
1998), as well as in combination with 5-fluorouracil/leucovorin as a first-line therapy 
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(Saltz et al, 2000; Rothenberg et al, 2001). 

Irinotecan hydrolysis by carboxylesterase-2 is responsible for its activation to SN- 
38 (7-ethyl-10-hydroxycamptothecin), a topoisomerase I inhibitor of much higher potency 
than irinotecan. The main inactivating pathway of irinotecan is the biotransformation of 
5 active SN-38 into inactive SN-38 glucuronide (SN-38G). Interpatient differences in 
systemic formation of SN-38G have been shown to have clear clinical consequences in 
patients treated with irinotecan. Patients with higher glucuronidation of SN-38 are more 
likely to be protected from the dose limiting toxicity of diarrhea in the weekly schedule 
(Gupta et al., 1994). SN-38 is glucuronidated by UDP-glucuronosyltransferase 1A1 
10 (UGT1A1) (Iyer etal., 1997). 

Despite its efficacy in treating metastatic colon cancer and its broad spectrum of 
activity in other tumor types, irinotecan treatment is associated with significant toxicity. 
SN-38 is an active metabolite of irinotecan, and SN-38 glucuronidation represents a 
mechanism to protect patients from the toxic effects of irinotecan (Gupta et al, 1994). 

15 Reduced SN-38 glucuronidation is thought to underlie the severe toxicity associated with 
irinotecan treatment in some patients (Gupta et al, 1994). The main severe toxicities of 
irinotecan are delayed diarrhea and myelosuppression. In the early single agent trials, 
grade 3-4 diarrhea occurred in about one third of patients and was dose limiting (Negoro 
et al, 1991; Rothenberg et al, 1993). Its frequency varies from study to study and is also 

20 schedule dependent. The frequency of grade 3-4 diarrhea in the three-weekly regimen 
(19%) is significantly lower compared to the weekly schedule (36%, Fuchs et al, 2003). 
In addition to diarrhea, grade 3-4 neutropenia is also a common adverse event, with about 
30-40% of the patients experiencing it in both weekly and three-weekly regimens (Fuchs 
et al, 2003; Vanhoefer et al, 2001). Fatal events during irinotecan treatment have been 

25 reported. A high mortality rate of 5.3% and 1.6% was reported in the weekly and three- 
weekly single agent irinotecan regimens, respectively (Fuchs et al, 2003). 

Although retrospective analysis of UGT1A1 genetic variation in relation to severe 
toxicity after different irinotecan-based regimens has been conducted in Japanese patients 
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(Ando et at, 2000), prospective evaluation in a large trial has not been performed. 



Thus, the problem of identifying the effects of various promoter polymorphism 
combinations on the expression of UGT1A1 for the determination of UGT activity levels 
remains. Improved methods and compositions for the evaluation of risk for irinotecan 
5 toxicity in an individual or patient are still needed. 

SUMMARY OF THE INVENTION 

Metabolism of SN-38, an active metabolite of irinotecan, via glucuronidation 
represents a mechanism to protect patients from the toxic effects of irinotecan, thus a 

10 reduction in SN-38 glucuronidation contributes to the probability that toxicity associated 
with irinotecan may be experienced in patients. While some genetic basis for reduced 
SN-38 glucuronidation have been identified, other basis have yet to be identified. 
Therefore, there remains a need for improved methods and compositions for evaluating 
polymorphisms in one or both UGT1A1 genes of a patient and correlating a genotype with 

1 5 adverse effects of various therapies. 

The present invention is based on the fact that genetic variation is correlated with 
UGT1A1 expression and has several important clinical implications. The improved 
methods and compositions of the present invention may be used in determining if a 
treatment has a propensity to adversely affect a patient or what treatment may be 

20 appropriate or inappropriate for a particular patient. UGT1A1 basal transcription is 
affected by a polymorphic (TA) repeat (see FIG. 1 legend in Innocenti et at, 2002), in 
addition to a phenobarbital-responsive enhancer module (PBREM) that contains variants 
affecting inducible gene expression, as described herein. A "polymorphism" or "genetic 
polymorphism," as referred to herein, is the existence of two or more variant forms of a 

25 particular characteristic, e.g., a single nucleotide or a repeat a nucleotide or nucleotides. 
Generally, variations are due to the addition, deletion, or substitution of one or more 
nucleotides at a site or a variation in the number of tandem repeats of a DNA sequence. 
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In various embodiments, other polymorphisms within or outside the UGT1 gene locus, 
see Genbank accession number AF297093 which is incorporated herein by reference, may 
be used as long as an association of a polymorphism with a particular phenotype and/or 
haplotype can be established. Exemplary methods for genotyping a UGT1A gene may be 
5 found at least in U.S. Patents 6,479,236, 6,472,157 and 6,395,481, each of which is 
incorporated herein by reference. 

In various embodiments of the invention, significant linkage disequilibrium 
between a (TA) polymorphism and variants in the PBREM, or other variants within or 
outside the UGT1 gene locus, indicates that patients possessing such other variants may 
be at risk of irinotecan toxicity. "Significant" as used in respect to linkage 
disequilibrium, as determined by one of skill in the art, is contemplated to be a statistical 
p or a value that may be 0.25 or 0.1 and may be 0.1, 0.05. 0.001, 0.00001 or less. 
"Linkage disequilibrium" ("LD" as used herein, though also referred to as "LED" in the 
art) refers to a situation where a particular combination of alleles (i.e., a variant form of a 
given gene) or polymorphisms at two loci appears more frequently than would be 
expected by chance. The relationship between PBREM-(TA) n haplotypes and the 
glucuronidation rate of the UGT1A1 substrate SN-38 may be used to correlate the 
genotype (i.e., the genetic make up of an organism) to a phentoype (i.e., the physical traits 
displayed by an organism or cell). "Haplotype" is used herein to refer to a collective 
genotype of two or more closely linked loci. Each haplotype defines the sequence of 
alleles or polymorphisms along one of the homologous chromosomes. In some 
embodiments, the polymorphisms may be 0.001, 0.01, 0.1, 0.2 cM or more within one 
another. 

Various embodiments of the invention include methods for evaluating the risk of 
25 toxicity from irinotecan, or other UGT1 Al substrates, in a patient. A polymorphism may 
be a single nucleotide polymorphism (SNP) and may be in linkage disequilibrium with a 
(TA) n repeat. In certain embodiments, the methods include detecting one or more 
polymorphisms in one or both copies of the UGT1A1 gene and/or one or both copies of 
any other gene located in the UGT1 gene locus of a patient. In particular embodiments a 
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15 



promoter polymorphism is detected. It is specifically contemplated that methods and 
compositions of the invention may be implemented to determine whether UGT1A1 
polymorphisms are present or absent in one or both alleles. 

In certain embodiments, a polymorphism may be a polymorphism that affects the 
5 transcription of UGT1A1, such as in the promoter region or 5' flanking region that affects 
transcription (which includes the promoter region), and in particular a polymorphism at 
nucleotide position -3440, -3401, -3279, -3177, -3175, or -3156 from the UGT1A1 gene 
transcriptional start site, which is designated +1 with no nucleotide designated as 0. The 
number of TA repeats can be 5, 6, 7, 8 or more TA repeats. In particular embodiments, 
10 the polymorphism is the following: -3440OA, -3401T>C, -3279G>T, -31770G, - 
3175A>G, -3156G>A, or any combination thereof. The notation -3440OA, for example 
indicates that cytosine nucleotide (C) at the -3440 position is replaced by an Adenosine 
(A). 

Methods of the invention may include obtaining a nucleic acid sample from a 
15 patient and detecting one or more polymorphisms in the UGT1A1 gene using various 
methods. In certain embodiments, polymorphism detection may include amplifying a 
nucleic acid containing all or part of a particular region of the UGT1A1 gene to obtain 
amplification products; and/or analyzing the amplification products for the presence or 
absence of one or more polymorphisms. Other methods of polymorphism detection 
20 known in the art are also contemplated. 

In certain embodiments, a promoter polymorphism of a UGT1A1 gene may be 
detected by performing one of a variety of known assays. These may include, but are not 
limited to hybridization assays, sequencing or microsequencing assays, allele-specific 
amplification assays or any other methods known for detecting nucleic acid 
25 polymorphisms, which may or may not include amplification of a nucleic acid. It is 
understood that "detecting" a polymorphism includes identifying the nucleotide sequence 
at that site and/or determining whether the polymorphism is present or absent. 
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A correlation between one or more polymorphisms and the glucoronidation rate of 
irinotecan or other substrates of UGT1A1, including but not limited to bilirubin, estriol, 
beta-estradiol, 2-hydroxyestriol, 2-hydroxyestrone, 2-hydroxyestradiol, thyroxine (T4), 
rT3, octyl gallate, propyl gallate, anthraflavic acid, quercitin, fisetin, naringenin, 1- 
5 naphtol, and ethynylestradiol, may be used to determine various aspects of a treatment 
regime, including irinotecan and/or other drugs or compounds metabolized directly or 
indirectly by UGT1A1. In some embodiments the methods also include analyzing the 
glucuronidation rate associated with the various polymorphisms and polymorphism 
combinations, for exemplary methods and compositions related to analysis of 
10 glucuronidation rates see U.S. Patent 6,319,678, which is incorporated herein by 
reference. The methods may also include determining the biliary transport capacity of the 
patient. In particular embodiments the evaluation of the promoter polymorphism may be 
used to optimize the dose of irinotecan or other compounds for treatment of a patient or 
to reduce their toxicity. 

15 The methods of the invention may further include treating a patient by 

administering to the patient irinotecan in combination with other pharmaceutical agents at 
appropriate dosages, such that the toxicity of irinotecan or other substrates of UGT1A1 
are reduced. In particular embodiments, a second agent that reduces excretion of an 
active irinotecan species through the bile may administered in conjunction with irinotecan 

20 based upon determinations made using methods and compositions of the invention, for 
related methods and compositions see U.S. Patents 6,407,117, 6,287,834 and 5,786,344, 
each of which is incorporated herein by reference. 

The present invention is also based on the observation that the nucleotide at 
position -3156 in the UGT1A1 upstream region is correlated with irinotecan toxicity. An 
25 A at that position positively correlates with irinotecan toxicity while a G at that position 
correlates with tolerance to irinotecan. Thus, the present invention concerns methods and 
compositions for evaluating, predicting, and determining whether a patient will 
experience toxicity from irinotecan. Toxicity from irinotecan evidences itself as side 
effects from the drug, which are well known to oncologists and their patients. 
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In some embodiments of the invention, there are methods of predicting whether a 
patient may suffer or be subject to toxicity from irinotecan if given it involving 
determining the nucleic acid sequence of base -3156 in the UGT1A1 5' flanking region in 
one or both alleles of the patient. The presence of an A nucleotide indicates the person is 
5 at risk for irinotecan toxicity. An AA genotype is more closely correlated with grade 4 
neutropenia than other genotypes at that position. Moreover, in some embodiments, this 
is unrelated to the genotype of the TA indel in the UGT1A1 promoter. It is contemplated 
that these methods concerning the indel at position -3156 in the UGT1A1 5' flanking 
region can be implemented with methods involving determining one or more other 
10 polymorphisms in the UGT1A1 5' flanking region of the same patient. 

Consequently, if a person is identified as at risk for irinotecan toxicity based on 
any of the embodiments discussed herein, an alternative course of therapy or a lower dose 
of irinotecan than is normally given may be contemplated. In addition, methods also 
include determining the sequence of other polymorphisms or indels (insertion/deletions) 
15 in linkage disequilibrium (LD) with the -3156 variant. Therefore, in some embodiments 
of the invention, the TA indel is evaluated to determine the number of repeats. Also, any 
other variant in UGT1 Al or any other gene (the term "gene" includes non-coding regions 
that affect the expression or activity level of the encoded polypeptide) may be evaluated 
for variants in LD with the -3156 variant. 

20 Various embodiments may include a kit for evaluating the risk of irinotecan 

toxicity in a patient. The kit may include a variety of containers, reagents and the like. In 
certain embodiments, the kit may include an oligonucleotide primer to amplify a 
promoter region of a UGT1A1 gene or genes, haplotype tag SNPs or allele specific 
amplification primers of the UGT1A1 gene or any other primer within the UGT1 gene 

25 locus. The haplotype tag SNPs or allele specific primers may be used to amplify a 
polymorphism at one or more nucleotide positions of the UGT1A1 gene or other UGT1 
locus gene. In particular embodiments, the nucleotide position may be at -3440, -3401, - 
3279, -3177, -3175, or -3156, or a combination thereof, from the UGT1A1 gene 
transcriptional start site. The kit may include the haplotype tag SNPs or allele specific 
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amplification primers in multi-well assay plate. The kit may also include haplotype tag 
SNPs or allele specific hybridization probes for a variety of promoter polymorphisms. 
The haplotype tag SNPs or allele specific hybridization probes may detect polymorphisms 
at nucleotide position -3440, -3401, -3279, -3177, -3175, or -3156 from the UGT1A1 
5 gene transcriptional start site. The kit may include haplotype tag SNPs or allele specific 
hybridization probes comprised in an oligonucleotide array or microarray. 

Compositions of the invention include nucleic acids that can be used to determine 
the sequence at position -3156 of UGT1A or other reagents in that regard. Arrays and 
other assays for screening multiple samples are also included as part of the invention. 
1 0 Such compositions may be incorporated into kits or as part of a kit, along with any other 
composition discussed herein. 

It is contemplated that any method or composition described herein can be 
implemented with respect to any other method or composition described herein. 
Similarly, any embodiment discussed with respect to one aspect of the invention may be 
1 5 used in the context of any other aspect of the invention. 

Throughout this application, the term "about" is used to indicate that a value 
includes the standard deviation of error for the device or method being employed to 
determine the value. 

The use of the word "a" or "an" when used in conjunction with the term 
20 "comprising" in the claims and/or the specification may mean "one," but it is also 
consistent with the meaning of "one or more," "at least one," and "one or more than one." 

Other objects, features and advantages of the present invention will become 
apparent from the following detailed description. It should be understood, however, that 
the detailed description and the specific examples, while indicating specific embodiments 
25 of the invention, are given by way of illustration only, since various changes and 
modifications within the spirit and scope of the invention will become apparent to those 
skilled in the art from this detailed description. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



The following drawings form part of the present specification and are included to 
further demonstrate certain aspects of the present invention. The invention may be better 
5 understood by reference to one or more of these drawings in combination with the 
detailed description of specific embodiments presented herein. 

FIG. 1 illustrates an exemplary phenobarbital-responsive enhancer module 
(PBREM) and description of polymorphisms (SEQ ID NO: 14). The previously described 
PBREM domains are underlined with the NR half-site sequences shown in bold. The 
10 polymorphic sites of the present application are included. The variants found at these 
sites are also listed. Positions indicated are from the first base of the UGT1A cluster 
sequence (Genbank accession No. AF297093) (SEQ ID NO:l). 

FIG. 2A-2D illustrates (TA)„ genotype-phenotype relationship in human livers, 
(a) correlation in all samples investigated (n=83) (b) correlation in Caucasians (n=56) (c) 
15 correlation in African- Americans (n=15) (d) correlation in individuals of Asian (n=l), 
and unknown (n=10) ethnicity. Liver microsomes were phenotyped for SN-38 
glucuronidation rates in each liver with a single experiment performed in triplicate. Bars 
show the mean value of SN-38 glucuronidation rates in each group. 

FIG. 3 illustrates an exemplary haplotype-phenotype relationship in human livers 
20 of Caucasian and African origin (n=70). Bars show the mean value of SN-38 
glucuronidation rates in each group. Only haplotypes with > 2 samples are shown. 

FIG. 4. Correlation between ANC and TA indel genotype. Bars represent the 
means. Nonparametric trend analysis (7/7<6/7<6/6, z = -2.72, p=0.01). 

FIG. 5. Pre-treatment total bilirubin levels and distribution of the -3156 
25 genotypes within each TA indel genotype. The -3156 AA genotypes are reported in 
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squares, the GA genotypes in circles and the GG genotypes in triangles. Bars represent 
the mean values. A significant trend was reported (7/7>6/7>6/6, z=2.88, p<0.01, 
nonparametric trend analysis). 

FIG. 6. Correlation between ln(ANC nadir) and pretreatment total bilirubin 
5 levels. Patients with bilirubin levels less than 0.6 mg/dl are depicted in squares. Those 
with bilirubin levels higher than 0.7 mg/dl are depicted in circles. 

DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

The present invention provides improved methods and compositions for 
10 identifying the effects of various polymorphisms, promoter polymorphisms or any 
combination thereof on the expression of UGT1A1 or the glucuronidation rate of 
UGT1A1 for the evaluation of the potential or risk for irinotecan toxicity in an individual 
or patient. The development of these improved methods and compositions allows for the 
use of such an evaluation to optimize treatment of a patient and to lower the risk of 
15 toxicity. In certain aspects of the invention various combinations of promoter 
polymorphisms may be used in this evaluation, in particular, polymorphisms in the 
PBREM region and polymorphisms in the TA repeats may be used. 

Genetic variation in UGT1A1 expression has several important clinical 
implications. UGT1A1 basal transcription is affected by a polymorphic (TA) repeat. 

20 Another important regulatory element is the phenobarbital-responsive enhancer module 
(PBREM) which may contain variants affecting inducible gene expression. The 
examples provided herein study the extent of linkage disequilibrium between the (TA) 
polymorphism and variants in the PBREM and UGT1A1 promoter. The relationship 
between PBREM-(TA) n haplotypes and the glucuronidation rate of the UGT1A1 substrate 

25 SN-38 is also addressed herein. Studies described in the Examples illustrate that SN-38G 
formation rates were correlated with (TA) genotypes and promoter variants. In vaious 
aspects particular (TA) variants are in linkage disequilibrium with various other 
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polymorphisms. 

Certain aspects of the invention are based on, but not limited to, the observation 
and characterization of novel polymorphisms in the PBREM region of the UGT1A1 gene. 
Due to the clinical implications of genetically modified regulation of UGT1A1 activity, 
5 the PBREM region was sequenced and polymorphisms in the TATA box of the UGT1A1 
promoter genotyped, as described in the Examples section below. 

I. HEPATIC GLUCURONIDATION BY UGT ENZYMES 

Hepatic glucuronidation results from the activities of a multigene family of UGT 
enzymes, the members of which exhibit specificity for a variety of endogenous substrates 

10 and xenobiotics. The UGT enzymes are broadly classified into two distinct gene families. 
The UGT1 locus codes for multiple isoforms of UGT, all of which share a C-terminus 
encoded by a unique set of exons 2-5, but which have a variable N-terminus encoded by 
different first exons, each with its own independent promoter (Bosma et al, 1992; Ritter 
et al, 1992). The variable first exons confer the substrate specificity of the enzyme. 

1 5 Isoforms of the UGT2 family are unique gene products of which at least eight isozymes 
have been identified (Clarke et al. Handbook of Experimental Pharmacology 1994). The 
UGT1A1 isoform is the major bilirubin glucuronidation enzyme. Genetic defects in the 
UGT1A1 gene can result in decreased glucuronidation activity which leads to abnormally 
high levels of unconjugated serum bilirubin that may enter the brain and cause 

20 encephalopathy and kernicterus (Owens & Ritter, 1995). This condition is commonly 
known as Gilbert's syndrome. The molecular defect in Gilbert's Syndrome is a change in 
the TATA box within the UGT1A1 promoter (Bosma et al, 1995 and Monaghan et al, 
1996). This promoter usually contains a (TA) 6 TAA element, but another allele, termed 
UGT1A1*28 or allele 7, is also present in human populations at high frequencies, and 

25 contains the sequence (TA) 7 TAA. This polymorphism in the promoter of the UGT1A1 
gene results in reduced expression of the gene and accounts for most cases of Gilbert's 
Syndrome (Bosma et al, 1995). Overall, gene expression levels for the UGT1A1 
promoter alleles are inversely related to the length of the TA repeat in the TATA box. 

55123561.1 

-16- 



The variation observed in this promoter may also account for the inter-individual 
and inter-ethnic variation in drug metabolism and response to xenobiotic exposure. 
UGTs have been shown to contribute to the detoxification and elimination of both 
exogenous and endogenous compounds. For example, one typical role of the UGT1A1 
5 isoform is the glucuronidation of SN-38 (7-ethyl-10-hydroxycamptothecin) to the 
corresponding glucuronide (10-O-glucuronyl-SN-38, SN-38G) as well as the 
glucuronidation of TAS-103 (6-[[2-(dimethylamino)ethyl]amino]-3-hydroxy-7H- 
indeno[2,l-c]quinoline-7-one dihydrochloride) to its corresponding glucuronide (TAS- 
103G). SN-38 is the active form of irinotecan (CPT-11, 7-ethyl-10-[4-(l-piperidino)-l- 

10 piperidino]carbonyloxycamptothecin), which is a camptothecin derivative used in the 
treatment of metastatic colorectal cancer and other malignancies. The metabolism of SN- 
38 and TAS-103 (also known as flavopiridol) are merely illustrative of the present 
invention the metabolism of other UGT1A1 substrates is also contemplated, such as 
estradiol, bilirubin, simple phenols, flavones, CI 8 steroids, complex phenols and 

1 5 coumarins. 

Irinotecan is biotransformed by tissue and serum carboxylesterases to an active 
metabolite, SN-38, which has a 100-1,000-fold higher antitumor activity than irinotecan. 
SN-38 is glucuronidated by hepatic uridine diphosphate glucuronosyltransferases (UGTs) 
to form SN-38 glucuronide (10O-glucuronyl-SN-38, SN-38G), which is inactive and 
20 excreted into the bile and urine although, SN-38G might be deconjugated to form SN-38 
by intestinal p -glucuronidase enzyme (Kaneda et al, 1990). 

The major dose-limiting toxicities of irinotecan include diarrhea and, to a lesser 
extent, myelosuppression. irinotecan-induced diarrhea can be serious and often does not 
respond adequately to conventional antidiarrheal agents (Takasuna et al, 1995). This 
25 diarrhea may be due to direct enteric injury caused by the active metabolite, SN-38, which 
has been shown to accumulate in the intestine after intra peritoneal administration of 
irinotecan in athymic mice (Araki et al, 1993). The results of a recently completed phase 
I clinical trial, demonstrated that there was an inverse relationship between SN-38 
glucuronidation rates and severity of diarrheal incidences in patients treated with 
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increasing doses of Irinotecan (Gupta et al, 1994). These findings indicate that 
glucuronidation of SN-38 protects against Irinotecan-induced gastrointestinal toxicity. A 
complete discussion of the correlation between diarrhea and SN-38 glucuronidation, as 
well as a description of biochemical methods for determining glucurondation levels can 
5 be found in US Patent 5,786,344 and W096/01 127 which are both incorporated herein by 
reference in their entirety. Likewise, the results of studies using TAS-103 demonstrate 
that glucuronidation of TAS-103 may protect against TAS-103 induced toxicity. 
Therefore, the conversion of these two toxic compounds by hepatic UGTs demonstrates 
the importance of monitoring glucuronidation activity as an indicator of susceptibility to 
10 toxicity caused by exposure to compounds that are metabolized by UGTs. Furthermore, 
differential rates of SN-38 glucuronidation among subjects may explain the considerable 
inter-individual variation in the pharmacokinetic parameter estimates and toxicities 
observed after treatment with anti-cancer drugs or exposure to xenobiotics (Gupta et al, 
1994; Gupta et al, 1997). 

15 When two species, Gunn rats (Gunn, 1938) and CN-1 patients, that are deficient 

in UGT1A isoforms were screened for TAS-103 and SN-38 glucuronidation activity, 
there was approximately an 80% lower glucuronidation rate of TAS-103 in vitro and no 
in vitro glucuronidation of SN-38 compared to healthy liver donors. These results 
demonstrate the role of the UGT 1 family in catalyzing SN-38 and TAS-103 conjugation. 

20 Furthermore, these results demonstrate that the UGT2 family does not play a role in the 
glucuronidation of SN-38. On the other hand, while isoforms of the UGT1 family are the 
predominate isoforms involved in TAS-103 glucuronidation, the isoform of the UGT2 
family may also participate in TAS-103 glucuronidation. Failure to glucuronidate SN-38 
and TAS-103 in these instances may result specifically from the genetic defect in UGT1 

25 gene family. 

Other experiments confirm the association between the UGT1A1 isoform and SN- 
38 and TAS-103 glucuronidation. These studies show that substantial genetic variability 
exists in the UGT1A1 isoform family and particularly in the UGT1A1 promoter. This 
genetic variability has been shown to correlate with gene expression. For example, the 
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presence of the 5 allele in the UGT1A1 promoter leads to increased gene expression while 
the presence of the 8 allele leads to reduced gene expression. Differences in gene 
expression levels may give rise to individuals with varying abilities to glucuronidate 
compounds metabolized by UGTs. This prediction was confirmed through a correlation 
5 analysis of UGT1A1 promoter genotype and rate of in vitro SN-38 and TAS-103 
glucuronidation. 

It follows therefore that individuals with the 8 allele may also have differing 
susceptibility to xenobiotics when compared to other genotypes when those compounds 
are metabolized by UGTlAls. On the other hand, the presence of the 5 allele that 
correlates with increased gene expression and higher glucuronidation activity may result 
in the administration of less than optimum drug dosages. For example, when a drug 
metabolized by UGTlAls is administered to an individual with this polymorphism, the 
increased glucuronidation activity may cause more of the drug to be converted into the 
inactive metabolite in a shorter period, thereby reducing the drug's effectiveness. 
Conversely, in the rare case of drugs and xenobiotics that require glucuronidation for 
activation, decreased glucuronidation activity may cause less of the activated form of the 
drug or xenobiotic to be available. 

The fact that repeated sequences are intrinsically unstable and tend to lengthen 
and shorten as a result of unequal crossing-over during meiosis may explain the presence 
20 of other alleles, in addition to (TA)6 and (TA)7, in the population. Two additional alleles 
have been identified in human populations: allele 5, containing the sequence (TA)s TAA 
and allele 8, containing the sequence (TA)g TAA, see U.S. Patent 6,395,481, which is 
incorporated in its entirety by reference. Interestingly, alleles 5 and 8 were found 
predominantly in population samples from Sub-Saharan Africa, where they occur at lower 
25 frequencies than the common alleles 6 and 7 although it is possible that these two alleles 
are present across a variety of ethnic groups. The frequency of alleles 6 and 7 also 
appears to differ significantly across ethnic groups, with Asian and Amerindian 
populations showing the highest frequencies of allele 6. Conversely, alleles 6 and 7 occur 
at intermediate and similar frequencies among Caucasians and Sub-Saharan Africans. 
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Several hypotheses may be proposed regarding the selective pressures that might 
be responsible for the observed pattern of inter-population variation at the UGT1A1 
promoter. It was previously proposed that the maintenance of intermediate levels of 
bilirubin is adaptive (Beutler et al, 1998), and that the alleles at this promoter would be 

5 maintained in the population by balancing selection. This hypothesis is based on the 
observation that bilirubin is a potent antioxidant likely to have physiological significance 
(Stocker et al, 1987). However, it is also known that glucuronidation is an important 
detoxification step for many endogenous as well as exogenous compounds (Clarke & 
Burchell, 1994). In addition to TAS-103 and SN-38, UGT1A1 is likely to act on other 

10 substrates present in the environment, e.g., dietary components, environmental pollutants 
and carcinogens, which require detoxification as well as playing a role in the metabolism 
of bilirubin and other endogenous compounds. Within this framework, maintaining high 
levels of UGT1A1 gene expression would ensure rapid elimination of toxic or 
endogenous compounds and be advantageous. 

15 As described herein, the correlation between in vitro glucuronidation rate and 

UGT1A1 promoter polymorphism found for alleles 6 and 7 has been shown to extend to 
alleles 5 and 8. Because these alleles appear to be more frequent in subsets of human 
populations (for example, those of African origin), an even higher inter-individual 
variability in SN-38 and TAS-103 metabolism might be expected within these 

20 populations. Because the inverse relationship between TA repeat size and rate of SN-38 
glucuronidation extends to alleles 5 and 8, a screening assay that identifies these alleles 
can facilitate individualization of drug therapy, identify individuals susceptible to 
xenobiotic exposure, and can be used to improve drug dosage calculations. 

II. NUCLEIC ACIDS 

25 Certain embodiments of the present invention concern various nucleic acids, 

including promoters, amplification primers, oligonucleotide probes and other nucleic acid 
elements involved in the analysis of genomic DNA. In certain aspects, a nucleic acid 
comprises a wild-type, a mutant, or a polymorphic nucleic acid. 
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The term "nucleic acid" is well known in the art. A "nucleic acid" as used herein 
will generally refer to a molecule (i.e., a strand) of DNA, RNA or a derivative or analog 
thereof, comprising a nucleobase. A nucleobase includes, for example, a naturally 
occurring purine or pyrimidine base found in DNA (e.g., an adenine "A," a guanine "G," a 
5 thymine "T" or a cytosine "C") or RNA (e.g., an A, a G, an uracil "U" or a C). The term 
"nucleic acid" encompass the terms "oligonucleotide" and "polynucleotide," each as a 
subgenus of the term "nucleic acid." The term "oligonucleotide" refers to a molecule of 
between about 3 and about 100 nucleobases in length. The term "polynucleotide" refers 
to at least one molecule of greater than about 100 nucleobases in length. A "gene" refers 
10 to coding sequence of a gene product, as well as introns and the promoter of the gene 
product. In addition to the UGT1A1 gene, other regulatory regions such as enhancers for 
UGT1A1 are contemplated as nucleic acids for use with compositions and methods of the 
claimed invention. 

These definitions generally refer to a single-stranded molecule, but in specific 
embodiments will also encompass an additional strand that is partially, substantially or 
fully complementary to the single-stranded molecule. Thus, a nucleic acid may 
encompass a double-stranded molecule or a triple-stranded molecule that comprises one 
or more complementary strand(s) or "complement(s)" of a particular sequence comprising 
a molecule. As used herein, a single stranded nucleic acid may be denoted by the prefix 
"ss", a double stranded nucleic acid by the prefix "ds", and a triple stranded nucleic acid 
by the prefix "ts." 

In particular aspects, a nucleic acid encodes a protein, polypeptide, or peptide. In 
certain embodiments, the present invention concerns novel compositions comprising at 
least one proteinaceous molecule. As used herein, a "proteinaceous molecule," 
25 "proteinaceous composition," "proteinaceous compound," "proteinaceous chain," or 
"proteinaceous material" generally refers, but is not limited to, a protein of greater than 
about 200 amino acids or the full length endogenous sequence translated from a gene; a 
polypeptide of greater than about 100 amino acids; and/or a peptide of from about 3 to 
about 100 amino acids. All the "proteinaceous" terms described above may be used 
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interchangeably herein. 

1. Preparation of Nucleic Acids 

A nucleic acid may be made by any technique known to one of ordinary skill in 
the art, such as for example, chemical synthesis, enzymatic production or biological 
5 production. Non-limiting examples of a synthetic nucleic acid (e.g., a synthetic 
oligonucleotide), include a nucleic acid made by in vitro chemical synthesis using 
phosphotriester, phosphite or phosphoramidite chemistry and solid phase techniques such 
as described in European Patent 266,032, incorporated herein by reference, or via 
deoxynucleoside H-phosphonate intermediates as described by Froehlere/ al., 1986 and 
10 U.S. Patent 5,705,629, each incorporated herein by reference. In the methods of the 
present invention, one or more oligonucleotide may be used. Various different 
mechanisms of oligonucleotide synthesis have been disclosed in for example, U.S. 
Patents 4,659,774, 4,816,571, 5,141,813, 5,264,566, 4,959,463, 5,428,148, 5,554,744, 
5,574,146, 5,602,244, each of which is incorporated herein by reference. 

15 A non-limiting example of an enzymatically produced nucleic acid include one 

produced by enzymes in amplification reactions such as PCR™ (see for example, U.S. 
Patent 4,683,202 and U.S. Patent 4,682,195, each incorporated herein by reference), or 
the synthesis of an oligonucleotide described in U.S. Patent 5,645,897, incorporated 
herein by reference. A non-limiting example of a biologically produced nucleic acid 

20 includes a recombinant nucleic acid produced (i.e., replicated) in a living cell, such as a 
recombinant DNA vector replicated in bacteria (see for example, Sambrook et al. 2001, 
incorporated herein by reference). 

2. Purification of Nucleic Acids 

A nucleic acid may be purified on polyacrylamide gels, cesium chloride 
25 centrifugation gradients, chromatography columns or by any other means known to one of 
ordinary skill in the art (see for example, Sambrook et al, 2001, incorporated herein by 
reference). In some aspects, a nucleic acid is a pharmacologically acceptable nucleic 
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acid. Pharmacologically acceptable compositions are known to those of skill in the art, 
and are described herein. 

In certain aspects, the present invention concerns a nucleic acid that is an isolated 
nucleic acid. As used herein, the term "isolated nucleic acid" refers to a nucleic acid 
5 molecule (e.g., an RNA or DNA molecule) that has been isolated free of, or is otherwise 
free of, the bulk of the total genomic and transcribed nucleic acids of one or more cells. 
In certain embodiments, "isolated nucleic acid" refers to a nucleic acid that has been 
isolated free of, or is otherwise free of, bulk of cellular components or in vitro reaction 
components such as for example, macromolecules such as lipids or proteins, small 
10 biological molecules, and the like. 

3. Nucleic Acid Segments 

In certain embodiments, the nucleic acid is a nucleic acid segment. As used 
herein, the term "nucleic acid segment," are fragments of a nucleic acid, such as, for a 
non-limiting example, those that encode only part of a UGT1 gene locus or a UGT1A1 
15 gene sequence. Thus, a "nucleic acid segment" may comprise any part of a gene 
sequence, including from about 2 nucleotides to the full length gene including promoter 
regions to the polyadenylation signal and any length that includes all the coding region. 

Various nucleic acid segments may be designed based on a particular nucleic acid 
sequence, and may be of any length. By assigning numeric values to a sequence, for 
20 example, the first residue is 1, the second residue is 2, etc., an algorithm defining all 
nucleic acid segments can be created: 

n to n + y 

where n is an integer from 1 to the last number of the sequence and y is the length of the 
nucleic acid segment minus one, where n + y does not exceed the last number of the 
25 sequence. Thus, for a 10-mer, the nucleic acid segments correspond to bases 1 to 10, 2 to 
1 1, 3 to 12 ... and so on. For a 15-mer, the nucleic acid segments correspond to bases 1 to 
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15, 2 to 16, 3 to 17 ... and so on. For a 20-mer, the nucleic segments correspond to bases 
1 to 20, 2 to 21, 3 to 22 ... and so on. In certain embodiments, the nucleic acid segment 
may be a probe or primer. As used herein, a "probe" generally refers to a nucleic acid 
used in a detection method or composition. As used herein, a "primer" generally refers to 
5 a nucleic acid used in an extension or amplification method or composition. 

4. Nucleic Acid Complements 

The present invention also encompasses a nucleic acid that is complementary to a 
nucleic acid. A nucleic acid is "complement(s)" or is "complementary" to another nucleic 
acid when it is capable of base-pairing with another nucleic acid according to the standard 
10 Watson-Crick, Hoogsteen or reverse Hoogsteen binding complementarity rules. As used 
herein "another nucleic acid" may refer to a separate molecule or a spatial separated 
sequence of the same molecule. In preferred embodiments, a complement is a 
hybridization probe or amplification primer for the detection of a nucleic acid 
polymorphism. 

15 As used herein, the term "complementary" or "complement" also refers to a 

nucleic acid comprising a sequence of consecutive nucleobases or semiconsecutive 
nucleobases (e.g., one or more nucleobase moieties are not present in the molecule) 
capable of hybridizing to another nucleic acid strand or duplex even if less than all the 
nucleobases do not base pair with a counterpart nucleobase. However, in some diagnostic 

20 or detection embodiments, completely complementary nucleic acids are preferred. 

III. NUCLEIC ACID DETECTION 

Some embodiments of the invention concern identifying polymorphisms in 
UGT1A1, correlating genotype or haplotype to phenotype, wherein the phenotype is 
25 lowered or altered UGT1A1 activity or expression, and then identifying such 
polymorphisms in patients who have or will be given irinotecan or related drugs or 
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compounds. Thus, the present invention involves assays for identifying polymorphisms 
and other nucleic acid detection methods. Nucleic acids, therefore, have utility as probes 
or primers for embodiments involving nucleic acid hybridization. They may be used in 
diagnostic or screening methods of the present invention. Detection of nucleic acids 
5 encoding UGT1A1, as well as nucleic acids involved in the expression or stability of 
UGT1A1 polypeptides or transcripts, are encompassed by the invention. General 
methods of nucleic acid detection methods are provided below, followed by specific 
examples employed for the identification of polymorphisms, including single nucleotide 
polymorphisms (SNPs). 

10 A. Hybridization 

The use of a probe or primer of between 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, or 15 
and 50, 60, 70, 80, 90, or 100 nucleotides, preferably between 17 and 100 nucleotides in 
length, or in some aspects of the invention up to 1-2 kilobases or more in length, allows 
the formation of a duplex molecule that is both stable and selective. Molecules having 
complementary sequences over contiguous stretches greater than 20 bases in length are 
generally preferred, to increase stability and/or selectivity of the hybrid molecules 
obtained. One will generally prefer to design nucleic acid molecules for hybridization 
having one or more complementary sequences of 20 to 30 nucleotides, or even longer 
where desired. Such fragments may be readily prepared, for example, by directly 
synthesizing the fragment by chemical means or by introducing selected sequences into 
recombinant vectors for recombinant production. 

Accordingly, the nucleotide sequences of the invention may be used for their 
ability to selectively form duplex molecules with complementary stretches of DNAs 
and/or RNAs or to provide primers for amplification of DNA or RNA from samples. 
25 Depending on the application envisioned, one would desire to employ varying conditions 
of hybridization to achieve varying degrees of selectivity of the probe or primers for the 
target sequence. 

For applications requiring high selectivity, one will typically desire to employ 
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relatively high stringency conditions to form the hybrids. For example, relatively low salt 
and/or high temperature conditions, such as provided by about 0.02 M to about 0.10 M 
NaCl at temperatures of about 50°C to about 70°C. Such high stringency conditions 
tolerate little, if any, mismatch between the probe or primers and the template or target 
5 strand and would be particularly suitable for isolating specific genes or for detecting a 
specific polymorphism. It is generally appreciated that conditions can be rendered more 
stringent by the addition of increasing amounts of formamide. For example, under highly 
stringent conditions, hybridization to filter-bound DNA may be carried out in 0.5 M 
NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and washing in 0.1 x 
10 SSC/0.1% SDS at 68°C (Ausubel et al, 1989). 

Conditions may be rendered less stringent by increasing salt concentration and/or 
decreasing temperature. For example, a medium stringency condition could be provided 
by about 0.1 to 0.25M NaCl at temperatures of about 37°C to about 55°C, while a low 
stringency condition could be provided by about 0.1 5M to about 0.9M salt, at 
15 temperatures ranging from about 20°C to about 55°C. Under low stringent conditions, 
such as moderately stringent conditions the washing may be carried out for example in 
0.2 x SSC/0.1% SDS at 42°C (Ausubel et al, 1989). Hybridization conditions can be 
readily manipulated depending on the desired results. 

In other embodiments, hybridization may be achieved under conditions of, for 
20 example, 50mM Tris-HCl (pH 8.3), 75mM KC1, 3mM MgCl 2 , l.OmM dithiothreitol, at 
temperatures between approximately 20°C to about 37°C. Other hybridization conditions 
utilized could include approximately lOmM Tris-HCl (pH 8.3), 50mM KC1, 1.5mM 
MgCl2, at temperatures ranging from approximately 40°C to about 72°C. 

In certain embodiments, it will be advantageous to employ nucleic acids of 
25 defined sequences of the present invention in combination with an appropriate means, 
such as a label, for determining hybridization. A wide variety of appropriate indicator 
means are known in the art, including fluorescent, radioactive, enzymatic or other ligands, 
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such as avidin/biotin, which are capable of being detected. In preferred embodiments, 
one may desire to employ a fluorescent label or an enzyme tag such as urease, alkaline 
phosphatase or peroxidase, instead of radioactive or other environmentally undesirable 
reagents. In the case of enzyme tags, colorimetric indicator substrates are known that can 
5 be employed to provide a detection means that is visibly or spectrophotometrically 
detectable, to identify specific hybridization with complementary nucleic acid containing 
samples. In other aspects, a particular nuclease cleavage site may be present and 
detection of a particular nucleotide sequence can be determined by the presence or 
absence of nucleic acid cleavage. 

10 In general, it is envisioned that the probes or primers described herein will be 

useful as reagents in solution hybridization, as in PCR, for detection of expression or 
genotype of corresponding genes, as well as in embodiments employing a solid phase. In 
embodiments involving a solid phase, the test DNA (or RNA) is adsorbed or otherwise 
affixed to a selected matrix or surface. This fixed, single-stranded nucleic acid is then 

15 subjected to hybridization with selected probes under desired conditions. The conditions 
selected will depend on the particular circumstances (depending, for example, on the 
G+C content, type of target nucleic acid, source of nucleic acid, size of hybridization 
probe, etc.). Optimization of hybridization conditions for the particular application of 
interest is well known to those of skill in the art. After washing of the hybridized 

20 molecules to remove non-specifically bound probe molecules, hybridization is detected, 
and/or quantified, by determining the amount of bound label. Representative solid phase 
hybridization methods are disclosed in U.S. Patents 5,843,663, 5,900,481 and 5,919,626. 
Other methods of hybridization that may be used in the practice of the present invention 
are disclosed in U.S. Patents 5,849,481, 5,849,486 and 5,851,772. The relevant portions 

25 of these and other references identified in this section of the Specification are 
incorporated herein by reference. 

B. Amplification of Nucleic Acids 

Nucleic acids used as a template for amplification may be isolated from cells, 
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tissues or other samples according to standard methodologies (Sambrook et al, 2001). In 
certain embodiments, analysis is performed on whole cell or tissue homogenates or 
biological fluid samples with or without substantial purification of the template nucleic 
acid. The nucleic acid may be genomic DNA or fractionated or whole cell RNA. Where 
5 RNA is used, it may be desired to first convert the RNA to a complementary DNA. 

The term "primer," as used herein, is meant to encompass any nucleic acid that is 
capable of priming the synthesis of a nascent nucleic acid in a template-dependent 
process. Typically, primers are oligonucleotides from ten to twenty and/or thirty base 
pairs in length, but longer sequences can be employed. Primers may be provided in 
10 double-stranded and/or single-stranded form, although the single-stranded form is 
preferred. 

Pairs of primers designed to selectively hybridize to nucleic acids corresponding 
to the UGT1 gene locus (Genbank accession AF297093), UGT1A1 gene and/or SEQ ID 
NO:l or variants thereof, and fragments thereof are contacted with the template nucleic 

15 acid under conditions that permit selective hybridization. SEQ ID NO:l set forth a 
nucleotide sequence that includes a majority of the UGT1A1 gene. SEQ ID NO:l 
includes nucleotides 169,831 to 187,313 of the UGT1 gene locus with nucleotide 1645 of 
SEQ ID NO:l corresponding to nucleotide -3565 from the transcriptional start of the 
UGT1A1 gene, thus the transcriptional start is located at nucleotide 5212 of SEQ ID 

20 NO:l. Depending upon the desired application, high stringency hybridization conditions 
may be selected that will only allow hybridization to sequences that are completely 
complementary to the primers. In other embodiments, hybridization may occur under 
reduced stringency to allow for amplification of nucleic acids that contain one or more 
mismatches with the primer sequences. Once hybridized, the template-primer complex is 

25 contacted with one or more enzymes that facilitate template-dependent nucleic acid 
synthesis. Multiple rounds of amplification, also referred to as "cycles," are conducted 
until a sufficient amount of amplification product is produced. 

The amplification product may be detected, analyzed or quantified. In certain 
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applications, the detection may be performed by visual means. In certain applications, the 
detection may involve indirect identification of the product via chemiluminescence, 
radioactive scintigraphy of incorporated radiolabel or fluorescent label or even via a 
system using electrical and/or thermal impulse signals (Affymax technology; Bellus, 
5 1994). 

A number of template dependent processes are available to amplify the 
oligonucleotide sequences present in a given template sample. One of the best known 
amplification methods is the polymerase chain reaction (referred to as PCR™) which is 
described in detail in U.S. Patents 4,683,195, 4,683,202 and 4,800,159, and in Innis et al, 
10 1988, each of which is incorporated herein by reference in their entirety. 

Another method for amplification is ligase chain reaction ("LCR"), disclosed in 
European Application No. 320 308, incorporated herein by reference in its entirety. U.S. 
Patent 4,883,750 describes a method similar to LCR for binding probe pairs to a target 
sequence. A method based on PCR™ and oligonucleotide ligase assay (OLA) (described 
15 in further detail below), disclosed in U.S. Patent 5,912,148, may also be used. 

Alternative methods for amplification of target nucleic acid sequences that may be 
used in the practice of the present invention are disclosed in U.S. Patents 5,843,650, 
5,846,709, 5,846,783, 5,849,546, 5,849,497, 5,849,547, 5,858,652, 5,866,366, 5,916,776, 
5,922,574, 5,928,905, 5,928,906, 5,932,451, 5,935,825, 5,939,291 and 5,942,391, Great 
20 Britain Application 2 202 328, and in PCT Application PCT/US89/01025, each of which 
is incorporated herein by reference in its entirety. Qbeta Replicase, described in PCT 
Application PCT/US87/00880, may also be used as an amplification method in the 
present invention. 

An isothermal amplification method, in which restriction endonucleases and 
25 ligases are used to achieve the amplification of target molecules that contain nucleotide 
5 '-[alpha-thio] -triphosphates in one strand of a restriction site may also be useful in the 
amplification of nucleic acids in the present invention (Walker et al, 1992). Strand 
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Displacement Amplification (SDA), disclosed in U.S. Patent 5,916,779, is another 
method of carrying out isothermal amplification of nucleic acids which involves multiple 
rounds of strand displacement and synthesis, i.e., nick translation. 

Other nucleic acid amplification procedures include transcription-based 
5 amplification systems (TAS), including nucleic acid sequence based amplification 
(NASBA) and 3SR (Kwoh et al, 1989; PCT Application WO 88/10315, incorporated 
herein by reference in their entirety). European Application 329 822 disclose a nucleic 
acid amplification process involving cyclically synthesizing single-stranded RNA 
("ssRNA"), ssDNA, and double-stranded DNA (dsDNA), which may be used in 
1 0 accordance with the present invention. 

PCT Application WO 89/06700 (incorporated herein by reference in its entirety) 
disclose a nucleic acid sequence amplification scheme based on the hybridization of a 
promoter region/primer sequence to a target single-stranded DNA ("ssDNA") followed by 
transcription of many RNA copies of the sequence. This scheme is not cyclic, i.e., new 
15 templates are not produced from the resultant RNA transcripts. Other amplification 
methods include "RACE" and "one-sided PCR" (Frohman, 1990; Ohara et al, 1989). 

C. Detection of Nucleic Acids 

Following any amplification, it may be desirable to separate the amplification 
product from the template and/or the excess primer. In one embodiment, amplification 
20 products are separated by agarose, agarose-acrylamide or polyacrylamide gel 
electrophoresis using standard methods (Sambrook et al, 2001). Separated amplification 
products may be cut out and eluted from the gel for further manipulation. Using low 
melting point agarose gels, the separated band may be removed by heating the gel, 
followed by extraction of the nucleic acid. 

25 Separation of nucleic acids may also be effected by spin columns and/or 

chromatographic techniques known in art. There are many kinds of chromatography 
which may be used in the practice of the present invention, including adsorption, 
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partition, ion-exchange, hydroxylapatite, molecular sieve, reverse-phase, column, paper, 
thin-layer, and gas chromatography as well as HPLC. 

In certain embodiments, the amplification products are visualized, with or without 
separation. A typical visualization method involves staining of a gel with ethidium 
5 bromide and visualization of bands under UV light. Alternatively, if the amplification 
products are integrally labeled with radio- or fluorometrically-labeled nucleotides, the 
separated amplification products can be exposed to x-ray film or visualized under the 
appropriate excitatory spectra. 

In one embodiment, following separation of amplification products, a labeled 
10 nucleic acid probe is brought into contact with the amplified marker sequence. The probe 
preferably is conjugated to a chromophore but may be radiolabeled. In another 
embodiment, the probe is conjugated to a binding partner, such as an antibody or biotin, 
or another binding partner carrying a detectable moiety. 

In particular embodiments, detection is by Southern blotting and hybridization 
1 5 with a labeled probe. The techniques involved in Southern blotting are well known to 
those of skill in the art (see Sambrook et al. , 2001). One example of the foregoing is 
described in U.S. Patent 5,279,721, incorporated by reference herein, which discloses an 
apparatus and method for the automated electrophoresis and transfer of nucleic acids. 
The apparatus permits electrophoresis and blotting without external manipulation of the 
20 gel and is ideally suited to carrying out methods according to the present invention. 

Other methods of nucleic acid detection that may be used in the practice of the 
instant invention are disclosed in U.S. Patents 5,840,873, 5,843,640, 5,843,651, 
5,846,708, 5,846,717, 5,846,726, 5,846,729, 5,849,487, 5,853,990, 5,853,992, 5,853,993, 
5,856,092, 5,861,244, 5,863,732, 5,863,753, 5,866,331, 5,905,024, 5,910,407, 5,912,124, 
25 5,912,145, 5,919,630, 5,925,517, 5,928,862, 5,928,869, 5,929,227, 5,932,413 and 
5,935,791, each of which is incorporated herein by reference. 

D. Other Assays 
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Other methods for genetic screening may be used within the scope of the present 
invention, for example, to detect mutations in genomic DNA, cDNA and/or RNA 
samples. Methods used to detect point mutations include denaturing gradient gel 
electrophoresis ("DGGE"), restriction fragment length polymorphism analysis ("RFLP"), 
5 chemical or enzymatic cleavage methods, direct sequencing of target regions amplified by 
PCR™ (see above), single-strand conformation polymorphism analysis ("SSCP") and 
other methods well known in the art. 

One method of screening for point mutations is based on RNase cleavage of base 
pair mismatches in RNA/DNA or RNA/RNA heteroduplexes. As used herein, the term 
10 "mismatch" is defined as a region of one or more unpaired or mispaired nucleotides in a 
double-stranded RNA/RNA, RNA/DNA or DNA/DNA molecule. This definition thus 
includes mismatches due to insertion/deletion mutations, as well as single or multiple 
base point mutations. 

U.S. Patent 4,946,773 describes an RNase A mismatch cleavage assay that 
15 involves annealing single-stranded DNA or RNA test samples to an RNA probe, and 
subsequent treatment of the nucleic acid duplexes with RNase A. For the detection of 
mismatches, the single-stranded products of the RNase A treatment, electrophoretically 
separated according to size, are compared to similarly treated control duplexes. Samples 
containing smaller fragments (cleavage products) not seen in the control duplex are 
20 scored as positive. 

Other investigators have described the use of RNase I in mismatch assays. The use 
of RNase I for mismatch detection is described in literature from Promega Biotech. 
Promega markets a kit containing RNase I that is reported to cleave three out of four 
known mismatches. Others have described using the MutS protein or other DNA-repair 
25 enzymes for detection of single-base mismatches. 

Alternative methods for detection of deletion, insertion or substitution mutations 
that may be used in the practice of the present invention are disclosed in U.S. Patents 
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5,849,483, 5,851,770, 5,866,337, 5,925,525 and 5,928,870, each of which is incorporated 
herein by reference in its entirety. 

E. Specific Examples of SNP Screening Methods 

Spontaneous mutations that arise during the course of evolution in the genomes of 
5 organisms are often not immediately transmitted throughout all of the members of the 
species, thereby creating polymorphic alleles that co-exist in the species populations. 
Often polymorphisms are the cause of genetic diseases. Several classes of 
polymorphisms have been identified. For example, variable nucleotide type 
polymorphisms (VNTRs), arise from spontaneous tandem duplications of di- or 
10 trinucleotide repeated motifs of nucleotides. If such variations alter the lengths of DNA 
fragments generated by restriction endonuclease cleavage, the variations are referred to as 
restriction fragment length polymorphisms (RFLPs). RFLPs are been widely used in 
human and animal genetic analyses. 

Another class of polymorphisms are generated by the replacement of a single 
nucleotide. Such single nucleotide polymorphisms (SNPs) rarely result in changes in a 
restriction endonuclease site. Thus, SNPs are rarely detectable restriction fragment length 
analysis. SNPs are the most common genetic variations and occur once every 100 to 300 
bases and several SNP mutations have been found that affect a single nucleotide in a 
protein-encoding gene in a manner sufficient to actually cause a genetic disease. SNP 
diseases are exemplified by hemophilia, sickle-cell anemia, hereditary hemochromatosis, 
late-onset alzheimer disease etc. 

In context of the present invention, polymorphic mutations that affect the activity 
and/or levels of the UGT1A1 gene products, which are responsible for the glucuronidation 
of irinotecan and other chemotherapeutic and xenobiotic agents, will be determined by a 
25 series of screening methods. One set of screening methods is aimed at identifying SNPs 
that affect the inducibility, activity and/or level of the UGT1A1 gene products in in vitro 
or in vivo assays. The other set of screening methods will then be performed to screen an 
individual for the occurrence of the SNPs identified above. To do this, a sample (such as 
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20 



blood or other bodily fluid or tissue sample) will be taken from a patient for genotype 
analysis. The presence or absence of SNPs will determine the ability of the screened 
individuals to metabolize irinotecan and other chemotherapeutic agents that are 
metabolized by the UGT1A1 gene products. According to methods provided by the 
5 invention, these results will be used to adjust and/or alter the dose of irinotecan or other 
agent administered to an individual in order to reduce drug side effects. 

SNPs can be the result of deletions, point mutations and insertions and in general 
any single base alteration, whatever the cause, can result in a SNP. The greater frequency 
of SNPs means that they can be more readily identified than the other classes of 

10 polymorphisms. The greater uniformity of their distribution permits the identification of 
SNPs "nearer" to a particular trait of interest. The combined effect of these two attributes 
makes SNPs extremely valuable. For example, if a particular trait {e.g., inability to 
efficiently metabolize irinotecan) reflects a mutation at a particular locus, then any 
polymorphism that is linked to the particular locus can be used to predict the probability 

1 5 that an individual will be exhibit that trait. 

Several methods have been developed to screen polymorphisms and some 
examples are listed below. The reference of Kwok and Chen (2003) and Kwok (2001) 
provide overviews of some of these methods; both of these references are specifically 
incorporated by reference. 

20 SNPs relating to glucuronidation of chemotherapeutic agents can be characterized 

by the use of any of these methods or suitable modification thereof. Such methods 
include the direct or indirect sequencing of the site, the use of restriction enzymes where 
the respective alleles of the site create or destroy a restriction site, the use of allele- 
specific hybridization probes, the use of antibodies that are specific for the proteins 

25 encoded by the different alleles of the polymorphism, or any other biochemical 
interpretation. 

i) DNA Sequencing 
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The most commonly used method of characterizing a polymorphism is direct 
DNA sequencing of the genetic locus that flanks and includes the polymorphism. Such 
analysis can be accomplished using either the "dideoxy-mediated chain termination 
method," also known as the "Sanger Method" (Sanger, F., et al, 1975) or the "chemical 
5 degradation method," also known as the "Maxam-Gilbert method" (Maxam, A. M., et al, 
1977). Sequencing in combination with genomic sequence-specific amplification 
technologies, such as the polymerase chain reaction may be utilized to facilitate the 
recovery of the desired genes (Mullis, K. et al, 1986; European Patent Application 
50,424; European Patent Application. 84,796, European Patent Application 258,017, 
10 European Patent Application. 237,362; European Patent Application. 201,184; U.S. 
Patents 4,683,202; 4,582,788; and 4,683,194), all of the above incorporated herein by 
reference. 

ii) Exonuclease Resistance 

Other methods that can be employed to determine the identity of a nucleotide 
15 present at a polymorphic site utilize a specialized exonuclease-resistant nucleotide 
derivative (U.S. Patent. 4,656,127). A primer complementary to an allelic sequence 
immediately 3 '-to the polymorphic site is hybridized to the DNA under investigation. If 
the polymorphic site on the DNA contains a nucleotide that is complementary to the 
particular exonucleotide-resistant nucleotide derivative present, then that derivative will 
20 be incorporated by a polymerase onto the end of the hybridized primer. Such 
incorporation makes the primer resistant to exonuclease cleavage and thereby permits its 
detection. As the identity of the exonucleotide-resistant derivative is known one can 
determine the specific nucleotide present in the polymorphic site of the DNA. 

iii) Microsequencing Methods 

25 Several other primer-guided nucleotide incorporation procedures for assaying 

polymorphic sites in DNA have been described (Komher, J. S. et al, 1989; Sokolov, B. 
P., 1990; Syvanen 1990; Kuppuswamy et al, 1991; Prezant et al, 1992; Ugozzoll, L. et 
al, 1992; Nyren et al, 1993). These methods rely on the incorporation of labeled 
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deoxynucleotides to discriminate between bases at a polymorphic site. As the signal is 
proportional to the number of deoxynucleotides incorporated, polymorphisms that occur 
in runs of the same nucleotide result in a signal that is proportional to the length of the 
run (Syvanen et al, 1990). 

5 iv) Extension in Solution 

French Patent 2,650,840 and PCT Application W09 1/02087 discuss a solution- 
based method for determining the identity of the nucleotide of a polymorphic site. 
According to these methods, a primer complementary to allelic sequences immediately 3'- 
to a polymorphic site is used. The identity of the nucleotide of that site is determined 
10 using labeled dideoxynucleotide derivatives which are incorporated at the end of the 
primer if complementary to the nucleotide of the polymorphic site. 

v) Genetic Bit Analysis or Solid-Phase Extension 

PCT Application W092/15712 describes a method that uses mixtures of labeled 
terminators and a primer that is complementary to the sequence 3' to a polymorphic site. 
15 The labeled terminator that is incorporated is complementary to the nucleotide present in 
the polymorphic site of the target molecule being evaluated and is thus identified. Here 
the primer or the target molecule is immobilized to a solid phase. 

vi) Oligonucleotide Ligation Assay (OLA) 

This is another solid phase method that uses different methodology (Landegren et 
20 al. , 1 988). Two oligonucleotides, capable of hybridizing to abutting sequences of a single 
strand of a target DNA are used. One of these oligonucleotides is biotinylated while the 
other is detectably labeled. If the precise complementary sequence is found in a target 
molecule, the oligonucleotides will hybridize such that their termini abut, and create a 
ligation substrate. Ligation permits the recovery of the labeled oligonucleotide by using 
25 avidin. Other nucleic acid detection assays, based on this method, combined with PCR 
have also been described (Nickerson et al, 1990). Here PCR is used to achieve the 
exponential amplification of target DNA, which is then detected using the OLA. 
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vii) Ligase/Polymerase-Mediated Genetic Bit Analysis 



U.S. Patent 5,952,174 describes a method that also involves two primers capable 
of hybridizing to abutting sequences of a target molecule. The hybridized product is 
formed on a solid support to which the target is immobilized. Here the hybridization 
5 occurs such that the primers are separated from one another by a space of a single 
nucleotide. Incubating this hybridized product in the presence of a polymerase, a ligase, 
and a nucleoside triphosphate mixture containing at least one deoxynucleoside 
triphosphate allows the ligation of any pair of abutting hybridized oligonucleotides. 
Addition of a ligase results in two events required to generate a signal, extension and 
10 ligation. This provides a higher specificity and lower "noise" than methods using either 
extension or ligation alone and unlike the polymerase-based assays, this method enhances 
the specificity of the polymerase step by combining it with a second hybridization and a 
ligation step for a signal to be attached to the solid phase. 

viii) Other Methods To Detect SNPs 

1 5 Several other specific methods for SNP detection and identification are presented 

below and may be used as such or with suitable modifications in conjunction with 
identifying polymorphisms of the UGT1A1 genes in the present invention. Several other 
methods are also described on the SNP web site of the NCBI at the website 
www.ncbi.nlm.nih.gov/SNP, incorporated herein by reference. 

20 In a particular embodiment, extended haplotypes may be determined at any given 

locus in a population, which allows one to identify exactly which SNPs will be redundant 
and which will be essential in association studies. The latter is referred to as 'haplotype 
tag SNPs (htSNPs)', markers that capture the haplotypes of a gene or a region of linkage 
disequilibrium. See Johnson et al. (2001) and Ke and Cardon (2003), each of which is 

25 incorporated herein by reference, for exemplary methods. 

The VDA-assay utilizes PCR amplification of genomic segments by long PCR 
methods using TaKaRa LA Taq reagents and other standard reaction conditions. The 
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long amplification can amplify DNA sizes of about 2,000-12,000 bp. Hybridization of 
products to variant detector array (VDA) can be performed by a Affymetrix High 
Throughput Screening Center and analyzed with computerized software. 

A method called Chip Assay uses PCR amplification of genomic segments by 
5 standard or long PCR protocols. Hybridization products are analyzed by VDA, Halushka 
et al, 1999, incorporated herein by reference. SNPs are generally classified as "Certain" 
or "Likely" based on computer analysis of hybridization patterns. By comparison to 
alternative detection methods such as nucleotide sequencing, "Certain" SNPs have been 
confirmed 100% of the time; and "Likely" SNPs have been confirmed 73% of the time by 
10 this method. 

Other methods simply involve PCR amplification following digestion with the 
relevant restriction enzyme. Yet others involve sequencing of purified PCR products 
from known genomic regions. 

In yet another method, individual exons or overlapping fragments of large exons 
15 are PCR-amplified. Primers are designed from published or database sequences and 

PCR-amplification of genomic DNA is performed using the following conditions: 200 ng 

DNA template, 0.5uM each primer, 80uM each of dCTP, dATP, dTTP and dGTP, 5% 

formamide, 1.5mM MgCl 2 , 0.5U of Taq polymerase and 0.1 volume of the Taq buffer. 

Thermal cycling is performed and resulting PCR-products are analyzed by PCR-single 
20 strand conformation polymorphism (PCR-SSCP) analysis, under a variety of conditions, 

e.g, 5 or 10% polyacrylamide gel with 15% urea, with or without 5% glycerol. 

Electrophoresis is performed overnight. PCR-products that show mobility shifts are 

reamplified and sequenced to identify nucleotide variation. 

In a method called CGAP-GAI (DEMIGLACE), sequence and alignment data 
25 (from a PHRAP.ace file), quality scores for the sequence base calls (from PHRED quality 
files), distance information (from PHYLIP dnadist and neighbour programs) and base- 
calling data (from PHRED '-d' switch) are loaded into memory. Sequences are aligned 
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and examined for each vertical chunk ('slice') of the resulting assembly for disagreement. 
Any such slice is considered a candidate SNP (DEMIGLACE). A number of filters are 
used by DEMIGLACE to eliminate slices that are not likely to represent true 
polymorphisms. These include filters that: (i) exclude sequences in any given slice from 
5 SNP consideration where neighboring sequence quality scores drop 40% or more; (ii) 
exclude calls in which peak amplitude is below the fifteenth percentile of all base calls 
for that nucleotide type; (iii) disqualify regions of a sequence having a high number of 
disagreements with the consensus from participating in SNP calculations; (iv) removed 
from consideration any base call with an alternative call in which the peak takes up 25% 
10 or more of the area of the called peak; (v) exclude variations that occur in only one read 
direction. PHRED quality scores were converted into probability-of-error values for each 
nucleotide in the slice. Standard Baysian methods are used to calculate the posterior 
probability that there is evidence of nucleotide heterogeneity at a given location. 

In a method called CU-RDF (RESEQ), PCR amplification is performed from 
1 5 DNA isolated from blood using specific primers for each SNP, and after typical cleanup 
protocols to remove unused primers and free nucleotides, direct sequencing using the 
same or nested primers. 

In a method called DEBNICK (METHOD-B), a comparative analysis of clustered 
EST sequences is performed and confirmed by fluorescent-based DNA sequencing. In a 
20 related method, called DEBNICK (METHOD-C), comparative analysis of clustered EST 
sequences with phred quality > 20 at the site of the mismatch, average phred quality >= 
20 over 5 bases 5'-FLANK and 3' to the SNP, no mismatches in 5 bases 5' and 3' to the 
SNP, at least two occurrences of each allele is performed and confirmed by examining 
traces. 

25 In a method identified by ERO (RESEQ), new primers sets are designed for 

electronically published STSs and used to amplify DNA from 10 different mouse strains. 
The amplification product from each strain is then gel purified and sequenced using a 
standard dideoxy, cycle sequencing technique with 33 P-labeled terminators. All the 
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ddATP terminated reactions are then loaded in adjacent lanes of a sequencing gel 
followed by all of the ddGTP reactions and so on. SNPs are identified by visually 
scanning the radiographs. 

In another method identified as ERO (RESEQ-HT), new primers sets are designed 
5 for electronically published murine DNA sequences and used to amplify DNA from 10 
different mouse strains. The amplification product from each strain is prepared for 
sequencing by treating with Exonuclease I and Shrimp Alkaline Phosphatase. 
Sequencing is performed using ABI Prism Big Dye Terminator Ready Reaction Kit 
(Perkin-Elmer) and sequence samples are run on the 3700 DNA Analyzer (96 Capillary 
10 Sequencer). 

FGU-CBT (SCA2-SNP) identifies a method where the region containing the SNP 
is PCR amplified using the primers SCA2-FP3 and SCA2-RP3. Approximately 100 ng 
of genomic DNA is amplified in a 50 ml reaction volume containing a final concentration 
of 5mM Tris, 25mM KC1, 0.75mM MgCl 2 , 0.05% gelatin, 20pmol of each primer and 
15 0.5U of Taq DNA polymerase. Samples are denatured, annealed and extended and the 
PCR product is purified from a band cut out of the agarose gel using, for example, the 
QIAquick gel extraction kit (Qiagen) and is sequenced using dye terminator chemistry on 
an ABI Prism 377 automated DNA sequencer with the PCR primers. 

In a method identified as JBLACK (SEQ/RESTRICT), two independent PCR 
20 reactions are performed with genomic DNA. Products from the first reaction are 
analyzed by sequencing, indicating a unique Fspl restriction site. The mutation is 
confirmed in the product of the second PCR reaction by digesting with Fsp I. 

In a method described as KWOK(l), SNPs are identified by comparing high 
quality genomic sequence data from four randomly chosen individuals by direct DNA 
25 sequencing of PCR products with dye-terminator chemistry (see Kwok et al, 1996). In a 
related method identified as KWOK (2) SNPs) are identified by comparing high quality 
genomic sequence data from overlapping large-insert clones such as bacterial artificial 

55123561.1 

-40- 



chromosomes (BACs) or PI -based artificial chromosomes (PACs). An STS containing 
this SNP is then developed and the existence of the SNP in various populations is 
confirmed by pooled DNA sequencing (see Taillon-Miller et ah, 1998). In another 
similar method called KWOK(3), SNPs are identified by comparing high quality genomic 
5 sequence data from overlapping large-insert clones BACs or PACs. The SNPs found by 
this approach represent DNA sequence variations between the two donor chromosomes 
but the allele frequencies in the general population have not yet been determined. In 
method KWOK(5), SNPs are identified by comparing high quality genomic sequence 
data from a homozygous DNA sample and one or more pooled DNA samples by direct 
10 DNA sequencing of PCR products with dye-terminator chemistry. The STSs used are 
developed from sequence data found in publicly available databases. Specifically, these 
STSs are amplified by PCR against a complete hydatidiform mole (CHM) that has been 
shown to be homozygous at all loci and a pool of DNA samples from 80 CEPH parents 
(seeKwok et ah, 1994). 

1 5 In another such method, KWOK (OverlapSnpDetectionWithPolyBayes), SNPs are 

discovered by automated computer analysis of overlapping regions of large-insert human 
genomic clone sequences. For data acquisition, clone sequences are obtained directly 
from large-scale sequencing centers. This is necessary because base quality sequences are 
not present/available through GenBank. Raw data processing involves analyzed of clone 

20 sequences and accompanying base quality information for consistency. Finished ('base 
perfect', error rate lower than 1 in 10,000 bp) sequences with no associated base quality 
sequences are assigned a uniform base quality value of 40 (1 in 10,000 bp error rate). 
Draft sequences without base quality values are rejected. Processed sequences are 
entered into a local database. A version of each sequence with known human repeats 

25 masked is also stored. Repeat masking is performed with the program "MASKERAID." 
Overlap detection: Putative overlaps are detected with the program "WUBLAST." 
Several filtering steps followed in order to eliminate false overlap detection results, i.e. 
similarities between a pair of clone sequences that arise due to sequence duplication as 
opposed to true overlap. Total length of overlap, overall percent similarity, number of 
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sequence differences between nucleotides with high base quality value "high-quality 
mismatches." Results are also compared to results of restriction fragment mapping of 
genomic clones at Washington University Genome Sequencing Center, finisher's reports 
on overlaps, and results of the sequence contig building effort at the NCBI. SNP 
detection: Overlapping pairs of clone sequence are analyzed for candidate SNP sites with 
the 'POLYBAYES' SNP detection software. Sequence differences between the pair of 
sequences are scored for the probability of representing true sequence variation as 
opposed to sequencing error. This process requires the presence of base quality values 
for both sequences. High-scoring candidates are extracted. The search is restricted to 
substitution-type single base pair variations. Confidence score of candidate SNP is 
computed by the POLYBAYES software. 

In method identified by KWOK (TaqMan assay), the TaqMan assay is used to 
determine genotypes for 90 random individuals. In method identified by KYUGEN(Ql), 
DNA samples of indicated populations are pooled and analyzed by PLACE-SSCP. Peak 
heights of each allele in the pooled analysis are corrected by those in a heterozygote, and 
are subsequently used for calculation of allele frequencies. Allele frequencies higher than 
10% are reliably quantified by this method. Allele frequency = 0 (zero) means that the 
allele was found among individuals, but the corresponding peak is not seen in the 
examination of pool. Allele frequency = 0-0.1 indicates that minor alleles are detected in 
the pool but the peaks are too low to reliably quantify. 

In yet another method identified as KYUGEN (Methodl), PCR products are post- 
labeled with fluorescent dyes and analyzed by an automated capillary electrophoresis 
system under SSCP conditions (PLACE-SSCP). Four or more individual DNAs are 
analyzed with or without two pooled DNA (Japanese pool and CEPH parents pool) in a 
series of experiments. Alleles are identified by visual inspection. Individual DNAs with 
different genotypes are sequenced and SNPs identified. Allele frequencies are estimated 
from peak heights in the pooled samples after correction of signal bias using peak heights 
in heterozygotes. For the PCR primers are tagged to have 5'-ATT or 5'-GTT at their ends 
for post-labeling of both strands. Samples of DNA (10 ng/ul) are amplified in reaction 
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mixtures containing the buffer (lOmM Tris-HCl, pH 8.3 or 9.3, 50mM KC1, 2.0mM 
MgCl 2 ), 0.25uM of each primer, 200uM of each dNTP, and 0.025 units/ul of Taq DNA 
polymerase premixed with anti-Taq antibody. The two strands of PCR products are 
differentially labeled with nucleotides modified with R110 and R6G by an exchange 
5 reaction of Klenow fragment of DNA polymerase I. The reaction is stopped by adding 
EDTA, and unincorporated nucleotides are dephosphorylated by adding calf intestinal 
alkaline phosphatase. For the SSCP: an aliquot of fluorescently labeled PCR products 
and TAMRA-labeled internal markers are added to deionized formamide, and denatured. 
Electrophoresis is performed in a capillary using an ABI Prism 310 Genetic Analyzer. 

10 Genescan softwares (P-E Biosystems) are used for data collection and data processing. 
DNA of individuals (two to eleven) including those who showed different genotypes on 
SSCP are subjected for direct sequencing using big-dye terminator chemistry, on ABI 
Prism 310 sequencers. Multiple sequence trace files obtained from ABI Prism 310 are 
processed and aligned by Phred/Phrap and viewed using Consed viewer. SNPs are 

1 5 identified by PolyPhred software and visual inspection. 

In yet another method identified as KYUGEN (Method2), individuals with 
different genotypes are searched by denaturing HPLC (DHPLC) or PLACE-SSCP 
(Inazuka et al, 1997) and their sequences are determined to identify SNPs. PCR is 
performed with primers tagged with 5'- ATT or 5'-GTT at their ends for post-labeling of 

20 both strands. DHPLC analysis is carried out using the WAVE DNA fragment analysis 
system (Transgenomic). PCR products are injected into DNASep column, and separated 
under the conditions determined using WAVEMaker program (Transgenomic). The two 
strands of PCR products that are differentially labeled with nucleotides modified with 
Rl 10 and R6G by an exchange reaction of Klenow fragment of DNA polymerase I. The 

25 reaction is stopped by adding EDTA, and unincorporated nucleotides are 
dephosphorylated by adding calf intestinal alkaline phosphatase. SSCP followed by 
electrophoresis is performed in a capillary using an ABI Prism 310 Genetic Analyzer. 
Genescan softwares (P-E Biosystems). DNA of individuals including those who showed 
different genotypes on DHPLC or SSCP are subjected for direct sequencing using big-dye 
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terminator chemistry, on ABI Prism 310 sequencer. Multiple sequence trace files 
obtained from ABI Prism 310 are processed and aligned by Phred/Phrap and viewed 
using Consed viewer. SNPs are identified by PolyPhred software and visual inspection. 
Trace chromatogram data of EST sequences in Unigene are processed with PHRED. To 

5 identify likely SNPs, single base mismatches are reported from multiple sequence 
alignments produced by the programs PHRAP, BRO and POA for each Unigene cluster. 
BRO corrected possible misreported EST orientations, while POA identified and 
analyzed non-linear alignment structures indicative of gene mixing/chimeras that might 
produce spurious SNPs. Bayesian inference is used to weigh evidence for true 

10 polymorphism versus sequencing error, misalignment or ambiguity, misclustering or 
chimeric EST sequences, assessing data such as raw chromatogram height, sharpness, 
overlap and spacing; sequencing error rates; context-sensitivity; cDNA library origin, etc. 

In method identified as MARSHFIELD(Method-B), overlapping human DNA 
sequences which contained putative insertion/deletion polymorphisms are identified 
15 through searches of public databases. PCR primers which flanked each polymorphic site 
are selected from the consensus sequences. Primers are used to amplify individual or 
pooled human genomic DNA. Resulting PCR products are resolved on a denaturing 
polyacrylamide gel and a Phosphorlmager is used to estimate allele frequencies from 
DNA pools. 

20 IV. PHARMACEUTICAL COMPOSITIONS 

Aqueous compositions may have an effective amount of irinotecan and/or an 
effective amount of a compound (second agent) that increases conjugative enzyme 
activity, as represented by a compound that increases the activity of the phase II 
conjugative enzyme, glucuronosyltransferase or that decreases biliary transport. Such 
25 compositions will generally be dissolved or dispersed in a pharmaceutical^ acceptable 
carrier or aqueous medium. 

The phrases "pharmaceutically or pharmacologically acceptable" refer to 
molecular entities and compositions that do not produce an adverse, allergic or other 
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untoward reaction when administered to an animal, or human, as appropriate. As used 
herein, "pharmaceutically acceptable carrier" includes any and all solvents, dispersion 
media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying 
agents and the like. The use of such media and agents for pharmaceutical active 
substances is well known in the art. Except insofar as any conventional media or agent is 
incompatible with the active ingredients, its use in the therapeutic compositions is 
contemplated. Supplementary active ingredients, such as other anti-cancer agents, can 
also be incorporated into the compositions. 

In addition to the compounds formulated for parenteral administration, such as 
intravenous or intramuscular injection, other pharmaceutically acceptable forms include, 
e.g., tablets or other solids for oral administration; time release capsules; and any other 
form currently used, including cremes, lotions, mouthwashes, inhalants and the like. 

A. Parenteral Administration 

The active compounds will often be formulated for parenteral administration, e.g., 
formulated for injection via the intravenous, intramuscular, sub-cutaneous, or even 
intraperitoneal routes. The preparation of an aqueous composition that contains 
irinotecan and a second agent as active ingredients will be known to those of skill in the 
art in light of the present disclosure. Typically, such compositions can be prepared as 
injectables, either as liquid solutions or suspensions; solid forms suitable for using to 
prepare solutions or suspensions upon the addition of a liquid prior to injection can also 
be prepared; and the preparations can also be emulsified. 

Solutions of the active compounds as free base or pharmacologically acceptable 
salts can be prepared in water suitably mixed with a surfactant, such as 
hydroxypropylcellulose. Dispersions can also be prepared in glycerol, liquid polyethylene 
glycols, and mixtures thereof and in oils. Under ordinary conditions of storage and use, 
these preparations contain a preservative to prevent the growth of microorganisms. 

The pharmaceutical forms suitable for injectable use include sterile aqueous 
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solutions or dispersions; formulations including sesame oil, peanut oil or aqueous 
propylene glycol; and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersions. In all cases the form must be sterile and must be fluid 
to the extent that easy syringability exists. It must be stable under the conditions of 
5 manufacture and storage and must be preserved against the contaminating action of 
microorganisms, such as bacteria and fungi. 

The active compounds may be formulated into a composition in a neutral or salt 
form. Pharmaceutically acceptable salts, include the acid addition salts (formed with the 
free amino groups of the protein) and which are formed with inorganic acids such as, for 
10 example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 
calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 
histidine, procaine and the like. 

1 5 The carrier can also be a solvent or dispersion medium containing, for example, 

water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyethylene 
glycol, and the like), suitable mixtures thereof, and vegetable oils. The proper fluidity 
can be maintained, for example, by the use of a coating, such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 

20 surfactants. The prevention of the action of microorganisms can be brought about by 
various antibacterial ad antifungal agents, for example, parabens, chlorobutanol, phenol, 
sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the 
injectable compositions can be brought about by the use in the compositions of agents 

25 delaying absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the active compounds in 
the required amount in the appropriate solvent with various of the other ingredients 
enumerated above, as required, followed by filtered sterilization. Generally, dispersions 
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are prepared by incorporating the various sterilized active ingredients into a sterile vehicle 
which contains the basic dispersion medium and the required other ingredients from those 
enumerated above. In the case of sterile powders for the preparation of sterile injectable 
solutions, the preferred methods of preparation are vacuum-drying and freeze-drying 
5 techniques which yield a powder of the active ingredient plus any additional desired 
ingredient from a previously sterile-filtered solution thereof. 

Upon formulation, solutions will be administered in a manner compatible with the 
dosage formulation and in such amount as is therapeutically effective. The formulations 
are easily administered in a variety of dosage forms, such as the type of injectable 
10 solutions described above, with even drug release capsules and the like being employable. 

For parenteral administration in an aqueous solution, for example, the solution 
should be suitably buffered if necessary and the liquid diluent first rendered isotonic with 
sufficient saline or glucose. These particular aqueous solutions are especially suitable for 
intravenous, intramuscular, subcutaneous and intraperitoneal administration. In this 

1 5 connection, sterile aqueous media which can be employed will be known to those of skill 
in the art in light of the present disclosure. For example, one dosage could be dissolved 
in lmL of isotonic NaCl solution and either added to lOOOmL of hypodermoclysis fluid 
or injected at the proposed site of infusion, (see for example, "Remington's 
Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570-1580). Some 

20 variation in dosage will necessarily occur depending on the condition of the subject being 
treated. The person responsible for administration will, in any event, determine the 
appropriate dose for the individual subject. 

B. Oral Administration 

In certain embodiments, active compounds may be administered orally. This is 
25 contemplated for agents which are generally resistant, or have been rendered resistant, to 
proteolysis by digestive enzymes. Such compounds are contemplated to include all those 
compounds, or drugs, that are available in tablet form from the manufacturer and 
derivatives and analogues thereof. 
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For oral administration, the active compounds may be administered, for example, 
with an inert diluent or with an assimilable edible carrier, or they may be enclosed in hard 
or soft shell gelatin capsule, or compressed into tablets, or incorporated directly with the 
food of the diet. For oral therapeutic administration, the active compounds may be 

5 incorporated with excipients and used in the form of ingestible tablets, buccal tables, 
troches, capsules, elixirs, suspensions, syrups, wafers, and the like. Such compositions 
and preparations should contain at least 0.1% of active compound. The percentage of the 
compositions and preparations may, of course, be varied and may conveniently be 
between about 2 to about 60% of the weight of the unit. The amount of active 

10 compounds in such therapeutically useful compositions is such that a suitable dosage will 
be obtained. 

The tablets, troches, pills, capsules and the like may also contain the following: a 
binder, as gum tragacanth, acacia, cornstarch, or gelatin; excipients, such as dicalcium 
phosphate; a disintegrating agent, such as corn starch, potato starch, alginic acid and the 

15 like; a lubricant, such as magnesium stearate; and a sweetening agent, such as sucrose, 
lactose or saccharin may be added or a flavoring agent, such as peppermint, oil of 
wintergreen, or cherry flavoring. When the dosage unit form is a capsule, it may contain, 
in addition to materials of the above type, a liquid carrier. Various other materials may be 
present as coatings or to otherwise modify the physical form of the dosage unit. For 

20 instance, tablets, pills, or capsules may be coated with shellac, sugar or both. A syrup of 
elixir may contain the active compounds sucrose as a sweetening agent methyl and 
propylparabens as preservatives, a dye and flavoring, such as cherry or orange flavor. Of 
course, any material used in preparing any dosage unit form should be pharmaceutically 
pure and substantially non-toxic in the amounts employed. In addition, the active 

25 compounds may be incorporated into sustained-release preparation and formulations. 

Upon formulation, the compounds will be administered in a manner compatible 
with the dosage formulation and in such amount as is therapeutically effective. The 
formulations are easily administered in a variety of dosage forms, such as those described 
below in specific examples. 
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C. Liposomes 



In a particular embodiment, liposomal formulations are contemplated. Liposomal 
encapsulation of pharmaceutical agents prolongs their half-lives when compared to 
conventional drug delivery systems. Because larger quantities can be protectively 
5 packaged, this allow the opportunity for dose-intensity of agents so delivered to cells. 
This would be particularly attractive in the chemotherapy of cervical cancer if there were 
mechanisms to specifically enhance the cellular targeting of such liposomes to these cells. 

"Liposome" is a generic term encompassing a variety of single and multilamellar 
lipid vehicles formed by the generation of enclosed lipid bilayers. Phospholipids are used 

1 0 for preparing the liposomes according to the present invention and can carry a net positive 
charge, a net negative charge or are neutral. Dicetyl phosphate can be employed to confer 
a negative charge on the liposomes, and stearylamine can be used to confer a positive 
charge on the liposomes. Liposomes are characterized by a phospholipid bilayer 
membrane and an inner aqueous medium. Multilamellar liposomes have multiple lipid 

1 5 layers separated by aqueous medium. They form spontaneously when phospholipids are 
suspended in an excess of aqueous solution. The lipid components undergo self- 
rearrangement before the formation of closed structures and entrap water and dissolved 
solutes between the lipid bilayers (Ghosh and Bachhawat, 1991). Also contemplated are 
cationic lipid-nucleic acid complexes, such as lipofectamine-nucleic acid complexes 

20 V. KITS 

Any of the compositions described herein may be comprised in a kit. In a non- 
limiting example, reagents for determining the genotype of one or both UGT1A1 genes 
are included in a kit. The kit may further include individual nucleic acids that can be 
amplify and/or detect particular nucleic acid sequences the UGT1A1 gene. It may also 
25 include one or more buffers, such as a DNA isolation bufffers, an amplification buffer or 
a hybridization buffer. The kit may also contain compounds and reagents to prepare 
DNA templates and isolate DNA from a sample. The kit may also include various 
labeling reagents and compounds. 
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The components of the kits may be packaged either in aqueous media or in 
lyophilized form. The container means of the kits will generally include at least one vial, 
test tube, flask, bottle, syringe or other container means, into which a component may be 
placed, and preferably, suitably aliquoted. Where there are more than one component in 

5 the kit (labeling reagent and label may be packaged together), the kit also will generally 
contain a second, third or other additional container into which the additional components 
may be separately placed. However, various combinations of components may be 
comprised in a vial. The kits of the present invention also will typically include a means 
for containing the nucleic acids, and any other reagent containers in close confinement for 

10 commercial sale. Such containers may include injection or blow-molded plastic 
containers into which the desired vials are retained. 

When the components of the kit are provided in one and/or more liquid solutions, 
the liquid solution is an aqueous solution, with a sterile aqueous solution being 
particularly preferred. However, the components of the kit may be provided as dried 
15 powder(s). When reagents and/or components are provided as a dry powder, the powder 
can be reconstituted by the addition of a suitable solvent. It is envisioned that the solvent 
may also be provided in another container means. 

A kit will also include instructions for employing the kit components as well the 
use of any other reagent not included in the kit. Instructions may include variations that 
20 can be implemented. 

It is contemplated that such reagents are embodiments of kits of the invention. 
Such kits, however, are not limited to the particular items identified above and may 
include any reagent used directly or indirectly in the detection of polymorphisms in the 
UGT1A1 gene or the activity level of the UGT1 Al polypeptide. 

25 

EXAMPLES 
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The following examples are included to demonstrate preferred embodiments of 
the invention. It should be appreciated by those of skill in the art that the techniques 
disclosed in the examples which follow represent techniques discovered by the inventor 
to function well in the practice of the invention, and thus can be considered to constitute 
preferred modes for its practice. However, those of skill in the art should, in light of the 
present disclosure, appreciate that many changes can be made in the specific 
embodiments which are disclosed and still obtain a like or similar result without 
departing from the spirit and scope of the invention. 

EXAMPLE 1 

MATERIALS AND METHODS FOR EXAMPLES 2-5 
Chemicals and reagents 

Exonuclease I and shrimp alkaline phosphatase (exo/SAP) was purchased from 
USB (Cleveland, Ohio, USA). ABI Big Dye terminator cycle-sequencing kit was 
purchased from Applied Biosystems (Foster City, California, USA). Primers for 
amplification, sequencing of the PBREM, and amplification of the (TA)„ polymorphism 
were obtained from GibcoBRL (Invitrogen Co., Carlsbad, California, USA). SN-38 was 
kindly provided by Dr Kiyoshi Terada (Yakult Honsha Co., Ltd, Japan). Camptothecin, 
UDPGA, magnesium chloride, trizma base, potassium monohydrogen phosphate and 1- 
heptanesulfonic acid were purchased from Sigma-Aldrich (St. Louis, Missouri, USA). 
Acetonitrile, tetrahydrofuran and hydrochloric acid were obtained from Fisher Scientific 
(Hanover, Illinois, USA). 

Human livers 

Normal human livers (n = 83) were mainly obtained from Liver Tissue 
Procurement and Distribution System (National Institutes of Diabetes and Digestive and 
Kidney Diseases, Minneapolis, Minnesota). DNA was isolated by using Qiagen 
RNA/DNA Maxi Kit (Qiagen Inc., Valencia, California, USA), and microsomes were 
isolated following differential centrifugation methods (Purba et al, 1987). DNA and 
microsomes were provided by the Liver Core Bank Facility (St. Jude Children's Research 
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Hospital) of the Pharmacogenetics of Anticancer Agents Research (PAAR) Group. In 
order to identify livers in which enzyme degradation occurred, liver samples consistently 
comprised in the 10th percentile of UGT1A1, UGT1A9 and UGT2B7 activities were 
sought. UGT1A9 and UGT2B7 activities were measured using specific probes (data not 
5 shown) (Ramirez et al, 2002 and Innocenti, et al, 2001). Out of eight samples within the 
10th percentile of UGT1A1, only one sample was comprised within the 10th percentile of 
activities of the other two enzyme activities. If different handling/storage of the liver or 
microsomal protein degradation occurred in that sample, this should not have affected the 
degree of phenotype/genotype correlation because the individual had a 7/7 genotype, and 
10 among the 7/7 genotype samples (n = 1 1), it had the 4th lower value. Moreover, lack of 
correlation between UGT1A1 and UGT2B7 activities (n = 83, r = 0.07, P = 0.5) shows that 
differences in tissue integrity and microsome stability have probably a mild influence (if 
any) on the UGT phenotype. 

The ethnic composition of the 83 liver donors comprised: Caucasians 68%, 
15 African- Americans 18%, Asians 1%, others 2%. The percentage of samples of unknown 
ethnic origin was 12%. 

Genotyping of (TA)„ polymorphism 

In order to genotype the (TA) n polymorphism, approximately 40 ng of DNA was 
subjected to amplification by polymerase chain reaction (PCR). The amplification 

20 primers used have been previously described (Monaghan et al, 1996), where the 
sequence of the forward primer is 5'-GTCACGTGACACAGTCAAAC-3' (SEQ ID NO:2) 
and that of the reverse primer is 5'-TTTGCTCCTGCCAGAGGTT-3' (SEQ ID NO:3). 
These primers flank the polymorphic TA locus in the promoter region of the UGT1A1 
gene and amplify a 98bp fragment when a (TA) 6 allele is present and a 100 bp fragment 

25 when a (TA) 7 allele is present. In the presence of (TA) 5 and (TA) 8 alleles, 96 bp and 102 
bp alleles are amplified. The reverse primer is labeled with a fluorescent dye at its 5'-end 
to permit visualization of the amplification product. The amplification reactions were 
performed in a 10ul volume consisting of 1.5 mmol MgCl 2 , 250 mmol dNTPs, 0.8 mmol 
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of each primer and 0.5 U of Taq polymerase (Amplitaq Gold from Applied Biosystems). 
The polymerase was activated at 95°C for 10 min and DNA amplified for 35 cycles at 
95°C for 30 sec, 55°C for 30 sec and 72°C for 30 sec, followed by a final extension at 
72°C for 10 min. Control DNAs from individuals known to have a 6/6, 6/7 and 7/7 
5 genotype were included in the PCR analysis. PCR fragments were subjected to gel 
electrophoresis on an ABI 377 DNA analyzer (Applied Biosystems). Amplified products 
were diluted in a formamide and dextran blue loading buffer and 1 ul combined with 1 ul 
of size standard (GS-350 from Applied Biosystems), denatured at 95°C, and loaded onto 
a 6% denaturing polyacrylamide gel. Electrophoresis was performed for 3.5 hours 
10 following manufacturers recommendations. The Genescan and Genotyper software 
(version 3.7, Applied Biosystems) was used to analyze fragments for size determination. 

Sequencing of PBREM 

A 606 bp region (-3641 to -3036) including PBREM was successfully PCR- 
amplified and sequenced in 81 of the 83 human liver DNAs and 22 of the 24 DNA 

1 5 samples from African- American individuals (Americans of African descent, born in the 
USA) included in the NIGMS HGCR Human VariationPanel (Coriell Institute for 
Medical Research, Camden, New Jersey, USA). The reference sequence shown in Fig. 1 
is that deposited in the GenBank database (accession number AF3 13454). Amplification 
of the PCR product was performed in a 10 or 25-pl reaction volume using the following 

20 primers: 5 '-CTGGGGATA A AC ATGGGATG-3 ' SEQ ID NO:4 (forward) and 5'- 
CACCACCACTTCTGGAACCT-3' SEQ ID NO:5 (reverse). These primers were 
designed using Primer3 software (Rozen et al, 1998). PCR conditions were 2 min at 
94°C, 32 or 33 cycles of a three-step cycling program (30 sec at 94°C, 30 sec at 66.8°C 
and 1 min at 72°C) and 72°C for 3 min. Following exo/SAP cleanup of the PCR product, 

25 this amplicon was then sequenced in the forward and reverse directions using the 
amplification primers, Big Dye terminator chemistry, and run on an ABI 3700 (Applied 
Biosystems) following the manufacturer's protocol. Sequences were analyzed and 
individuals genotyped using the Poly- Phred software (Nickerson et al, 1997). To 
determine the ancestral state of the polymorphisms found in humans, the sequence was 

55123561.1 

-53- 



compared to that of baboon (accession number AC091778). 
SN- 38 glucuronidation assay in human liver microsomes 

Samples were phenotyped by using SN-38 as a substrate for UGT1A1. The 
incubation mixture consisted of 5 umol SN-38, 10 mmol MgCl 2 , 1 mg/ml microsomes, 

5 0.025 mol Tris-HCl (pH 7.4) and 5 mmol UDP-GA. Samples were incubated for 30 min 
at 37°C. The reaction was stopped by the addition of methanol. These conditions were 
selected after previous optimization of the enzyme reaction (Iyer et al, 1998). 
Camptothecin (75 ng) was used as an internal standard. SN-38 glucuronidation was 
measured by HPLC (Hitachi Instruments Inc., San Jose, California, USA) with 

10 fluorescence detection (k excitation = 355 nm, X emission = 515 nm). A uBondapak™ 
Cig column (3.9 X 300 mm, 10 urn; Waters Corp., Milford, Massachusetts, USA) and 
uBondapak™ Ci 8 guardpak (Waters Corp.) were used. A mobile phase of 8/4/88 
acetonitrile/tetra-hydrofuran/0.9 mmol sodium heptanedfonic acid in 50 mmol potassium 
dihydrogen phosphate (pH 4) was used during the first 7 min of the run. From 7.1-25 

1 5 min, the eluent consisted of 30/70 acetonitrile/5 mmol sodium heptanesulfonic acid in 50 
mmol potassium dihydrogen phosphate (pH 4). The flow rate was 0.9 ml/min. Retention 
times for SN-38G, SN-38 and camptothecin were 13.3, 18.4 and 19.3 min, respectively. 
SN-38 glucuronidation rates were reported as the ratios between SN-38 glucuronide (SN- 
38G) and internal standard (IS) peak heights. The intra-assay variability was determined 

20 by performing 10 incubations on the same day using a pool of human liver microsomes. 
The inter-assay variability was evaluated by incubating a pool of human liver microsomes 
in triplicate on three different days. The inter- and intra- assay variabilities were within 
7%. 

Statistical analysis 

25 The significance of linkage disequilibrium between pairs of polymorphic sites was 

assessed using genotypic data and a likelihood ratio test provided in ARLEQUIN, version 
2 (Schneider et al, 2000). ARLEQUIN was also used to run a modified Markov-chain 
random walk algorithm to test for Hardy-Weinberg equilibrium. Next, multisite 
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haplotypes were estimated using the program PHASE (Stephens et al, 2001). Because 
this program does not accept both bi- allelic and multi-allelic polymorphic sites, 
haplotypes were estimated only for individuals with either the (TA) 6 or (TA) 7 alleles. 



Thirteen individuals were heterozygous for the (TA) 5 or (TA)g repeat, three of 
5 which were heterozygous only at the TA repeat and therefore unambiguous at the other 
sites. For the remaining 10 individuals, haplotypes were determined manually by 
assuming that the chromosome with the (TA) 6 or (TA) 7 allele contained a haplotype 
previously identified by the PHASE analysis. In one case, this method would have 
resulted in a new (TA)g haplotype. However, it is more likely that this individual would 
10 instead have a novel (TA) 6 haplotype (V), which is consistent with the observation that 
the (TA)6 allele is found on multiple haplotypes, including other rare ones. An incorrect 
assignment would have little or no affect on the subsequent analyses because the novel 
haplotype only occurs once out of 103 individuals and not in a sample used in studies of 
correlation with phenotype. 

1 5 The effective number of haplotypes was calculated as the reciprocal of the sum of 

the frequency squared. Diversity in (TA) 6 haplotypes in Caucasians and African- 
Americans, based on the numbers and frequencies of haplotypes and adjusted by sample 
size, was estimated by DnaSP version 3.53 (Rozas et al) as well as their SDs. Statistical 
significance was assessed using a t-test as previously described (Nei, 1987). The chi- 

20 square test was used to analyze the differences in genotype/haplotype frequencies 
between Caucasians and African-Americans. 

UGT1A1 activity was phenotyped by measuring SN-38 glucuronidation rates of 
each liver as the mean ± SD of a single experiment performed in triplicate. The statistical 
analysis of the relationship between the (TA) n polymorphism and phenotype was planned 
25 to assess first the genotype effect on phenotype in the population sample (n = 83) using 
the analysis of variance (ANOVA). If the genotype effect was statistically significant 
then, within each ethnic group, a test of trend across the genotypes was performed using 
the exact Jonkheerer-Terpstra (JT) test (Gibbons et al, 1992). Pairwise comparisons 
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between two genotypes were performed using an exact one-sided Wilcoxon test. 
Moreover, trend analysis and pairwise comparisons were performed in genotypes 
expressed as the sum of TA repeats in both chromosomes (i.e. in samples with < 12 (5/6, 
6/6, 5/7), 13 (6/7) and < 14 (7/7, 6/8, 7/8) TA repeat genotypes). Concerning the 
haplotype-phenotype relationship, two-sided exact Wilcoxon tests were used to compare 
the SN-38 glucuronidation rates between two haplotypes. SAS system (SAS Institute, 
Inc., Cary, North Carolina) and StatXact-5 (CYTEL Software Corporation, Cambridge, 
Massachusetts, USA) were used for statistical analysis. GraphPad software version 3.02 
(GraphPad Software Inc., San Diego, California, USA) was used for graphical analysis. 

EXAMPLE 2 
GENOTYPING OF (TA) N POLYMORPHISM 

The (TA)6 allele was the most common allele with a frequency of 0.58 while the 
(TA) 7 allele had a frequency of 0.36 (Table 1). (TA) 5 and (TA) 8 alleles were also found, 
although at lower frequencies (0.02 and 0.05, respectively). In the population sample (n = 
107), the most common genotype was 6/7 (0.41), followed by the 6/6 genotype (0.34). 
Rare genotypes (<0.02) included 5/6, 5/7 and 5/8 genotypes. The (TA) 6 and (TA) 7 allele 
frequencies were not significantly different between Caucasians and African-Americans 
(chi-square test, P= 0.7). Similarly, 6/6, 6/7, and 7/7 genotype frequencies were not 
different between the two ethnic groups (chi-square test, P = 0.8). One Asian individual 
had a 6/6 genotype, while two individuals with other ethnicities had 6/7 and 7/7 
genotypes. 



Table 1: (TA), 



(TA) n 



Population 

Sample 

(n=107) 



Caucasians 
(n=56) 



polymorphism: genotype frequencies 



0.01 0.02 



African- 
Americans 
(nz39) 
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EXAMPLE 3 
SEQUENCING OF PBREM 

In 103 samples, six polymorphisms were found, and two of them (-3279G>T and 
-3156G>A) are common, with frequencies of 0.39 and 0.30, respectively (FIG. 1, Table 
5 2). All six polymorphisms are in Hardy-Weinberg equilibrium (P > 0.5). Based upon 
comparisons to the baboon sequence (accession number AC091778, which is 
incorporated herein by reference), it is likely that -3279G and -31566 are the ancestral 
states. The most common -3279G>T polymorphism is located in the spacer sequence of 
the NR3 domain of PBREM (FIG 1). No variants were found in the gtNRl domain, the 
10 binding site for constitutive active receptor (CAR). -3279G was significantly more 
common among African- Americans compared to Caucasians (chi squared = 13.82, P = 
0.001) while the frequency of -3156A did not significantly differ between the two ethnic 
groups (chi-square test, P = 0.9). 

1 5 Table 2 Sequencing of PBREM: genotype frequencies 



Position 


-3440 


-3401 


-3279 


-3177 


-3175 


-3156 


Genotype 


CC 


CA 


TT 


TC 


GG 


GT 


TT 


CC 


CG 


AA 


AG 


GG 


GA 


AA 


Population 

Sample 

(n=103) 


0.96 


0.04 


0.99 


0.01 


0.38 


0.46 


0.16 


0.99 


0.01 


0.99 


0.01 


0.49 


0.43 


0.08 


Caucasian 
(n=55) 


0.96 


0.04 


1 


0 


0.18 


0.58 


0.24 


1 


0 


1 


0 


0.47 


0.44 


0.09 


African- 
Americans 
(n=37) 


0.95 


0.05 


0.97 


0.03 


0.73 


0.24 


0.03 


0.97 


0.03 


0.97 


0.03 


0.51 


0.41 


0.06 



EXAMPLE 4 

LINKAGE DISEQUILIBRIUM AND HAPLOTYPE STRUCTURE OF THE 
20 UGT1A1 PROMOTER 

A likelihood ratio test detected significant pairwise linkage disequilibrium 
between sites -3279, -3156 and the (TA)„ polymorphism in our population sample (n = 
103, P < 0.0001). When only the common (TA) 6 and (TA) 7 alleles were used for the 
linkage disequilibrium analysis, the same results were obtained (P < 0.0001). When 
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pairwise linkage disequilibrium was separately assessed in Caucasians and African- 
Americans, highly significant linkage disequilibrium was similarly detected in Caucasians 
(P < 0.0001). In African- Americans, pairwise linkage disequilibrium was also detected 
between all sites, however, the level of significance varied greatly between the pairwise 
5 comparisons. Only linkage disequilibrium between (TA) n and -3156 had significance 
levels similar to those seen for Caucasians (P < 0.0005) while linkage disequilibrium had 
only low levels of significance between (TA)„ and -3279 (P = 0.02) and between -3279 
and -3156 (P = 0.04). 

Multisite haplotype inference resulted in 10 haplotypes spanning the PBREM 
10 variants and the (TA) n polymorphism (Table 3). Haplotypes I-V include the (TA) 6 allele, 
and haplotype I differs from haplotype II at position -3279 in the NR3 domain of 
PBREM. Haplotypes VI, VII and VIII include the (TA) 7 repeat and haplotypes VI and VII 
differ from each other at position -3156. There is a suggestion that the haplotype 
structure of the (TA)6 allele is different in the African-American subsample. Compared 
15 to Caucasians, haplotype I is less common in African- Americans (chi squared = 27.06, P 
< 0.0001), while haplotype II is more common (chi squared = 14.84, P = 0.0001). 
Differences in haplotype VI and VII frequencies were not statistically significant between 
the two groups (chi- square test, P = 0.44 and 0.48, respectively). 

Among the samples examined, 21 different combinations of these haplotypes were 
20 found. In Caucasians, the most frequent haplotype pairs are I/VI (0.35), I/I (0.24) and I/II 
(0.11), while in African-Americans, they are I/II (0.11), II/VI (0.11), II/VIII (0.08), I/VI 
(0.08), II/VII (0.08) and VI/VI (0.08). The effective numbers of haplotypes, which reflect 
how many relatively high frequency haplotypes are observed, were 5.2 and 2.6 in African- 
Americans and Caucasians, respectively (Table 3). Finally, diversity (± SD) of (TA) 6 
25 haplotypes was 0.555 ± 0.070 and 0.262 ± 0.065 in African-Americans and Caucasians, 
respectively (P < 0.05). 
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African- 
Americans (n=37) 
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EXAMPLE 5 

UGT1A1 PHENOTYPING AND ASSOCIATION WITH (TA) N POLYMORPHISM 
AND HAPLOTYPES 

UGT1A1 activity was measured as SN-38 glucuronidation rates in 83 human liver 
5 microsomes. A 46% coefficient of variation (1.90 f 0.87 SN-38G/IS, mean f SD) and a 
10-fold range in SN-38 glucuronidation were observed. 

Because of the small number of subjects in the 5/7, 5/6, 6/8 and 7/8 genotypes, 
only 6/6, 6/7 and 7/7 were used in the ANOVA analysis. The phenotype was 
significantly different across these three genotypes (P =0.008) (FIG. 2a). The degree of 

10 variation of the SN-38 glucuronidation rate across the genotypes was similar in different 
ethnic groups (P > 0.1). A significantly decreasing trend was shown across the 6/6, 6/7 
and 7/7 genotypes in Caucasians (P < 0.001, JT test, FIG. 2b) and across the 6/6, 6/7, 6/8 
and 7/7 genotypes in African-Americans (P = 0.033, JT test) (FIG. 2c). When samples 
with Asian (n = 1), other (n = 2) and unknown (n = 10) ethnic background were pooled 

15 together, no significant trend could be found across (TA) n genotypes (P > 0.1, JT test) 
(FIG. 2d). In the Caucasian sample, pairwise comparisons of the phenotype between two 
genotype groups showed significant differences between the 6/7 and 7/7 (P= 0.007, one- 
sided exact Wilcoxon test) and 6/6 and 7/7 groups (P = 0.0002). No pair-wise 
comparison was significant within African-Americans, probably due to small number of 

20 samples of each genotype. 

When (TA) n genotypes were regarded as the sum of TA repeat number in both 
chromosomes (i.e. < 12 (5/6, 6/6, 5/7), 13 (6/7) and > 14 (7/7, 6/8, 7/8) genotypes), a 
significant trend of reduced UGT1A1 activity (P<0.01) was measured across the three 
groups (the lowest being the > 14 genotype group) in the whole sample population, in 
25 Caucasians, in African-Americans but not in samples with Asian/other/unknown ethnicity 
(P = 0.66). Pairwise comparisons (one-sided exact Wilcoxon test) showed significantly 
reduced UGT1A1 activity (P<0.01) in > 14 compared to 13 and < 12 genotypes, and in 13 
compared to < 12 genotypes in the whole sample population and in Caucasians. In 
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African- Americans < 12 genotypes had significantly higher UGT1A1 activity compared to 
either 13 or > 14 genotypes (P = 0.028 and 0.016, respectively), but UGT1A1 activity was 
not significantly different between 13 and > 14 genotypes (P = 0.1 1). 

In samples of Caucasian and African origin, SN-38 glucuronidation rate varies 
5 significantly across the haplotypes with a decreasing trend (P < 0.0001, JT test) (FIG. 3). 
However, this apparent haplotype-phenotype correlation is likely to be due to the effect of 
the (TA) n polymorphism that is in linkage disequilibrium with the PBREM variants. 
Hence, the possible functional effects of the common -3279G>T and -3156G>A variants 
were investigated by comparing the SN-38 glucuronidation rates across genotypes that 

10 differed only by the variant being examined. Concerning the -3279G>T variant, SN-38 
glucuronidation was reduced in I/II pairs compared to I/I pairs among Caucasians, 
although without reaching statistical significance (2.06 ± 0.74 versus 2.53 ± 0.82 SN- 
38G/IS, respectively) (Wilcoxon rank sum test, P = 0.18). Concerning the -3156G>A 
variant, although SN-38 glucuronidation is slightly reduced in I/VII compared to I/VI 

15 pairs, the difference is not statistically significant (Wilcoxon rank sum test, P = 0.64). 

EXAMPLE 6: 

MATERIALS AND METHODS FOR EXAMPLE 7 
Patient Selection 

Patients with histologically confirmed solid tumors or lymphoma known to 
20 respond to irinotecan or for which no therapy of proven benefit exists were eligible to 
participate in this study. Other eligibility criteria included measurable disease by 
radiologic imaging or physical examination; age of at least 18 years; Karnofsky 
performance status of at least 70% (ambulatory and capable of self-care); and adequate 
organ function defined as absolute neutrophil count (ANC) > 1500 ul" 1 , platelet count > 
25 100,000 ul" 1 , serum creatinine level < 1.5 mg/dl or creatinine clearance > 60 ml/min, AST 
and ALT levels < 5 times the upper limit of normal, and conjugated bilirubin within 
normal limits. Patients must have been off previous anticancer therapy, including 
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radiation therapy, for at least 4 weeks (6 weeks if the previous treatment included a 
nitrosourea or mitomycin C) and off colony stimulating factor for at least 2 weeks. 
Patients with a history of inflammatory bowel disease requiring therapy, chronic diarrheal 
syndrome, paralytic ileus, or organ or stem cell transplant were excluded from the study. 
5 Concurrent use of medications that may be substrates of the UGT1 Al enzyme or that may 
be inducers or inhibitors of UGT1A1 activity was not permitted. Pregnant and lactating 
women were also excluded from participation, and those with reproductive potential were 
required to use an effective contraceptive method if sexually active. 

Treatment Protocol 

10 Irinotecan was supplied by the National Cancer Institute (NCI) as an intravenous 

solution with concentration 20 mg/ml in either 2 ml or 5 ml vials. The amount of 
irinotecan to be administered was removed aseptically from the vial and added to 500 ml 
of 0.9% saline or 5% dextrose injection, USP. Thirty minutes after pretreatment with 20 
mg intravenous ondansetron, irinotecan 350 mg/m 2 was administered as a 90 minute 

15 intravenous infusion once every 3 weeks — a standard dose and schedule. History, 
physical examination, complete blood count (CBC) with differential, serum chemistry 
profile (electrolytes, blood urea nitrogen, creatinine, glucose, albumin, alkaline 
phosphatase, GGTP, AST, ALT, total and conjugated bilirubin, uric acid, and lactate 
dehydrogenase), and coagulation profile (prothrombin time and partial thromboplastin 

20 time) were conducted prior to first treatment. Thereafter, history, physical examination, 
and toxicity assessment were conducted on day 1 of each cycle unless treatment-related 
toxicity required more frequent follow up. CBC and serum chemistry profile were 
obtained weekly throughout treatment, though CBCs were obtained 3 times per week 
with the appearance of grade 3 or 4 neutropenia or thrombocytopenia. Toxicity 

25 assessment was done according to the NCI common toxicity criteria, version 2.0 
(website: ctep.cancer.gov). Objective tumor assessment by appropriate radiographic 
imaging was performed prior to starting therapy and after every 2 cycles. 

Toxicity Management and Dose Modification 
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For patients who experienced diarrhea, abdominal pain, or diaphoresis within 24 
hours of irinotecan administration, 0.25 mg to 1 mg of intravenous atropine was 
considered. Delayed diarrhea, defined as diarrhea occurring more than 24 hours after 
irinotecan administration, was treated promptly with loperamide 4 mg at the onset and 
5 then with 2 mg every 2 hours until the patient was diarrhea-free for at least 12 hours. For 
patients who failed loperamide therapy, diphenoxylate, octreotide, and tincture of opium 
were sequentially added as needed. Patients were instructed to aggressively hydrate 
orally and were admitted to the hospital for intravenous electrolyte and fluid replacement 
when necessary. A new course of therapy was not started until the ANC recovered to at 
10 least 1500 uT 1 , the platelet count recovered to at least 100,000 ul" 1 , and treatment-related 
diarrhea fully resolved. Patients with grade 3 or 4 toxicities of any kind were dose- 
reduced by 50 mg/m 2 for subsequent cycles. 

Sample Collection 

Prior to the first irinotecan infusion, venous blood (4.5 ml) for genotyping was 
15 collected in purple top Vacutainer® tubes containing EDTA (Becton, Dickinson, and 
Company, Franklin Lakes, NJ) and stored at -80°C for no more than 5 days prior to 
analysis. Venous blood for pharmacokinetic analysis was collected on day 1 of cycle 1 
for pharmacokinetic analysis. Samples of 7 ml were collected into green top sodium 
heparinized Vacutainer® tubes prior to the infusion; 30, 60, and 90 minutes during the 
20 infusion; and 10, 20, 30, 45, and 60 minutes and 1.5, 2, 4, 6, 12, and 24 hours after the 
infusion. Samples were centrifuged (2500 rpm, 20 min, 4°C) and the plasma was 
immediately separated, transferred as two aliquots into storage tubes, frozen at -80°C 
until analysis. 

UGT1A1 Genotyping Assays 

25 The variants typed in this study are listed in Table 4. The UGT1A1 (TA)nTAA 

polymorphism was genotyped by PCR and product sizing as previously described (Te et 
al, 2000). Alleles with 6 TA repeats resulted in a 98 bp fragment while alleles with 7 TA 
repeats resulted in a 100 bp fragment. Alleles with 5 TA and 8 TA repeats resulted in 96 
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bp and 102 bp fragments respectively. Alleles with 5, 6, 7, and 8 TA repeats are reported 
as (TA)n and genotypes are assigned based upon the number of TA repeats in each allele, 
i.e., 616, 611, 111, 6/8, et cetera. 

The variants in the 5 'upstream region (-3279G>T and -3156G>A) and in exon 1 
5 [21 1G>A (G71R) and 686OA (P229Q)] were genotyped by single base extension (SBE) 
and separated on a denaturing high performance liquid chromatography (DHPLC) system 
(Devaney et al, 2001). Genotyping of the -3279G>T and -3156G>A variants was 
performed by PCR amplification of a 333 bp fragment in the UGT1A1 5' upstream 
region that contains both variants. The PCR primers used were: 5'-ACC TCT AGT TAC 

10 ATA ACC TGA A-3' (forward primer; SEQ ID NO:6) and 5'-AAT AAA CCC GAC 
CTC ACC AC-3' (reverse primer; SEQ ID NO:7). PCRs were performed in a 15 ul 
volume containing 125 nM each primer, 2.5 mM MgCl 2 , 50 uM each dNTP and 0.375 U 
of AmpliTaq Gold polymerase (Applied Biosystems) in the buffer provided by the 
manufacturer. PCR cycling conditions were for 40 cycles at 95°C for 15 s, 58°C for 15 s 

15 and 72°C for 30 s in a 9600 thermal cycler (Applied Biosystems). PCR amplified 
products were purified using shrimp alkaline phosphatase and exonuclease I by 
incubating at 37°C for 45 min prior to the SBE reaction. SBE reactions were performed 
in duplex for genotyping of both variants in 10 jul volumes containing 1 uM of extension 
primer (5'-GCC AAG GGT AGA GTT CAG T-3' (SEQ ID NO: 8) for -3279G>T and 5'- 

20 GAC CCC AGC CCA CCT GTC-3' (SEQ ID NO:9) for -3156G>A), 250 uM each 
ddNTP and 1.25 U thermosequenase (Amersham Pharmacia Biotech). Reactions were 
cycled at 96°C for 30 s, 55°C for 30 s and 60°C for 30 s for 60 cycles. Separation of the 
SBE products was performed on a WAVE 3500HT DHPLC system (Transgenomic Inc) 
at 70°C after denaturation of the samples. The flow rate used was 1.5 ml/min and the run 

25 time for each sample was 2.5 min. The gradient used for elution of the SBE products was 
created by the software based on the length of the extended product and was adjusted 
from 24% to 34% buffer B over 2 min (buffer B contains 25% acetonitrile). Extended 
products were eluted in the order of C<G<T<A which is dependent on the hydrophobicity 
differences of the four bases. 
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Genotyping of the 211G>A and 686OA exon 1 variants was performed by PCR 
amplification of a 774 bp fragment that encompasses both variants. The PCR primers 
used were: 5'-ATG CTG GGA AGA TAC TGT TG-3' (forward primer; SEQ ID NO: 10) 
and 5'-TTT GGT GAA GGC AGT TGA TT-3' (reverse primer; SEQ ID NO: 11). PCRs 

5 were performed in a 15 ul volume containing 125 nM each primer, 2.5 mM MgCl 2 , 100 
uM each dNTP and 0.375 U of AmpliTaq Gold polymerase (Applied Biosystems) in the 
buffer provided by the manufacturer. PCR cycling conditions were for 40 cycles at 95°C 
for 15 s, 55°C for 15 s and 72°C for 45 s in a 9600 thermal cycler (Applied Biosystems). 
PCR purification was performed as described above and the SBE reactions were 

10 performed in lOul volumes containing 1 uM of each extension primer (5'-GTC TTC 
A AG GTG TAA AAT GCT C-3' (SEQ ID NO: 12) for 211G>A or the 5'-GTG CGA 
CGT GGT TTA TTC CC-3' (SEQ ID NO: 13) for 686OA) using the conditions 
described above. For separation on the DHPLC system, a flow rate of 1.5 ml/min and a 
run time of 3 min was used for each sample. The gradient used for elution of the SBE 

1 5 products was created by the WAVE software based on the length of the extended product 
and was adjusted from 25.6% to 38.1% buffer B over 2.5 min. 

Pharmacokinetic Analysis 

Plasma concentrations of irinotecan and its metabolites were determined as 
previously published (Iyer et ah, 2001). Pharmacokinetic parameters for irinotecan, SN- 
20 38, and SN-38G were calculated using standard non-compartmental methods with 
WinNonlin 2.0 (Pharsight Corporation, Mountain View, CA). The area under the plasma 
concentration-time curve (AUC) from time zero to the last measured concentration of 
irinotecan and metabolites was determined by the linear trapezoidal method. The 
glucuronidation ratio was expressed as the ratio of the SN-38G AUC over SN-38 AUC. 

25 Statistical Analysis 

The study was originally designed to prospectively investigate the relationship 
between genetic variation in the UGT1 Al promoter and grade 3-4 diarrhea. Results from 
clinical trials using the 350 mg/m2 every 3 weeks schedule suggested a 20 to 35% 
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frequency of diarrhea (ref). Based on previously published data, a single-gene Mendelian 
model implied that 16% of patients would have the 7/7 genotype, 48% would have the 
6/7 genotype, and 36% would have the 6/6 genotype. A sample size of 60 would have 
had power of 0.8 at a=0.05 to detect a linear trend in the proportion of patients within 
5 each genotype experiencing grade 3-4 diarrhea defined by 60% of 7/7 patients, 30% of 
6/7 patients, and 10% of 6/6 patients. 

However, due to lower than expected frequency of grade 3-4 diarrhea (see below), 
the analyses were instead focused on the frequency of grade 4 neutropenia (ANC < 500 
ul" 1 ). Nonparametric trend tests were used to investigate how the genotype is related to 

10 pharmacokinetic parameters, pretreatment bilirubin levels and ANC nadir. The 
relationship between genotype and grade 4 neutropenia was assessed by the use of 
Fisher's exact test and calculation of the relative risks. Univariate regression analyses 
were performed to identify the potential predictors of ANC nadir. They were performed 
on the log scale for ANC to reduce skewness in the residuals. The pretreatment variables 

15 were also considered jointly via analysis of covariance (ANCOVA) models in order to 
identify the pretreatment measurements that can predict ln(ANC nadir). A different 
ANCOVA model simultaneously considering the pre- and post- treatment variables was 
used to explore the mechanism through which variability in UGT1A1 status might affect 
the ANC nadir. 

20 EXAMPLE 7: 

Role of-3516G>A of UGT1A1 in Irinotecan Toxicity 
Patient Characteristics 

Sixty-six patients were enrolled in the study (Table 5). Blood was mistakenly not 
drawn for DNA extraction in one patient and genotype information is available in 65 
25 patients. Sixty-three patients were assessable for toxicity as 3 patients (one 6/6, one 6/7, 
one 7/8) missed scheduled blood tests and/or physician appointments. Sixty patients are 
assessable for tumor response, as 6 of them were removed from the study before 
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radiological assessment of tumor response. All the patients received prior chemotherapy 
regimens. Thirty-five of them received additional prior radiotherapy. 

Allele and Genotype Frequencies 

5 The TA indel allele frequencies were: TA 6 =0.68, TA 7 =0.29, TA 8 =0.02, 

TA 5 =0.01. The TA 5 and TA 8 alleles occurred exclusively in Black patients (one with 5/6, 
two patients with 6/8, and one patient with 7/8 genotype). -3279T and -3156A alleles 
had a frequency of 0.55 and 0.26, respectively. 

Table 6 shows the frequencies of promoter haplotypes comprising -3279, -3156, 
10 and the TA indel, based upon our previous publication on their linkage disequilibrium 
(Innocenti et al., 2002). The frequency of the haplotype pairs is shown in Table 7. No 
exon 1 variants (21 1G>A and 686OA) were detected in this patient population. 

Toxicity Prevalence, Relative Risk, Genetic Test. 

15 Toxicity of diarrhea and neutropenia refer to events observed during cycle 1 of 

treatment. The frequency of grade 4 neutropenia was 9.5%. Grade 4 neutropenia was 
much more common in patients with genotype 7/7 (3/6, 50%) compared to patients with 
6/7 genotype (3/24, 12.5%) and 6/6 genotype (0/30, 0%) (p=0.001, Fisher's exact test). 
Nonparametric trend analysis revealed that the TA indel polymorphism is significantly 

20 correlated to ln(ANC nadir) (7/7<6/7<6/6, z=-2.35, p = 0.02) (FIG. 4). 

Because the -3156G>A variant distinguishes between two different haplotypes in 
the TA 7 individuals, the relative risk of grade 4 neutropenia was analyzed for the -3156 
AA genotype (versus AG and GG combined) and 7/7 genotype (versus 5/6, 6/6, 6/7 and 
6/8 combined). A higher relative risk was found in patients with -3156 AA genotype 
25 (14.0, 95% CI 2.1-36.7) compared to patients with 7/7 genotype (9.3, 95% CI 1.7-40.7, 
n=63). Moreover, the predictive power of a genetic test in patients receiving irinotecan 
was evaluated for both the TA indel and the -3156 variant (Table 8). The predictive 
power of either 7/7 or -3156 AA genotypes for grade 4 neutropenia was evaluated. In 
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addition, the predictive power of either 6/6 or -3156 GG genotypes was evaluated in 
relation to the absence of grade 4 neutropenia (i.e., grade 0-3). In this comparison, the 
two 6/8 patients were regarded as either 6/6 or 6/7 genotypes in order to assess whether 
patients with the TAg allele might be a confounding factor for the results of the genetic 
5 test. 

While this study was originally conceived to examine the relationship between 
UGT1A1 genotype and severity of diarrhea, the frequency of grade 3 diarrhea in our 
patients was only 5% (n=3), with no instances of grade 4 diarrhea. None of the three 
patients with grade 3 diarrhea were 6/6 (2 6/7 and one 7/7 genotypes). Concerning the 
10 diarrhea events in patients with the TAg allele (two 6/8 and one 7/8), only a grade 1 event 
was reported in one 6/8 patient. The low frequency of severe diarrhea did not allow any 
formal statistical analysis. 

Total Bilirubin: Correlation with TA Indel Genotype and Toxicity 

15 Pretreatment total bilirubin levels were obtained in all patients (0.5+0.22 mg/dl, 

mean+SD, n=66). As is shown in FIG. 5, total bilirubin levels were significantly 
correlated with the TA indel polymorphism (nonparametric trend analysis, 7/7>6/7>6/6, z 
= 2.88, p < 0.01). Total bilirubin levels were significantly higher in 7/7 patients 
compared to 6/6 and 6/7 patients combined (0.80+0.29 and 0.48+0.19 mg/dl, respectively, 

20 p = 0.0003). Concerning the distribution of the -3156 genotypes withing each TA indel 
genotype group, in the 6/7 genotype group, the three patients with GG genotype had low 
bilirubin levels of 0.3-0.4 mg/dl. Similarly, the two patients with 6/8 and GG genotypes 
had low levels of bilirubin of 0.2-0.3 mg/dl. The one patient with GA genotype in the 7/7 
group has a bilirubin level of 0.6 mg/dl, which is in the low range for this genotype group. 

25 The 7/8 patient did not have markedly elevated levels of total bilirubin as would be 
expected if the TAg allele resulted in decreased glucuronidation. 

In addition, the -3156 and the TA indel variants were correlated with total 
bilirubin by multiple regression analysis. The AA genotype showed a slightly better 
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correlation (r 2 =0.28, p<0.0001) compared to 7/7 genotype, either when the TA& alleles 
were regarded as TA 6 (r 2 =0.23, p=0.002) or TA 7 (r 2 =0.20, p=0.0009). The other common 
variant -3279G>T had no significant association with total bilirubin. 

Whether pre-treatment bilirubin would correlate with neutropenia was also 
5 analyzed. Significantly higher bilirubin levels were observed in patients with grade 4 
neutropenia (0.83+0.21 mg/dl) compared to those without grade 4 neutropenia (0.47±0.20 
mg/dl) (p=0.0001) (FIG. 6). No cases of grade 4 neutropenia were reported in patients 
with bilirubin levels less than 0.6 mg/dl. Out of the 7 patients with total bilirubin higher 
then 0.7 mg/dl, 4 of them had grade 4 neutropenia. 

1 0 Correlation Between TA Indel Genotype and PK Parameters 

Table 9 describes the pharmacokinetic parameters of irinotecan and its 
metabolites stratified by 6/6, 6/7, and 7/7 genotypes. SN-38 AUC increases while 
increasing the number of TA7 alleles (nonparametric trend analysis, 7/7>6/7>6/6, z = 
2.13, p = 0.03). Conversely, glucuronidation ratios (SN-38G/SN-38 AUC ratios) were 
15 reduced while increasing the number of TA 7 alleles, (nonparametric trend analysis, 
6/6>6/7>7/7, z=-2.16, p = 0.03). No significant trend was found for irinotecan and SN- 
38G AUCs (p>0.05). 

Regression Analysis 

20 The impact of both pharmacokinetic variability and pre-treatment (including 

genotype) variables on variability in neutropenia was also examined. Instead of the TA 
indel genotype, the -3156 variant was used because 1) the -3156 genotype was better 
correlated with the risk of grade 4 neutropenia and 2) -3156 better reflected the UGT1A1 
status of patients, based upon the data on the correlation with total bilirubin. Univariate 

25 regression analyses of ANC nadir selected SN-38 AUC, total bilirubin and -3156 
genotype as the three best independent variables (Table 10). Gender showed a non- 
significant correlation with ANC nadir but it was included in further modeling because of 
possible gender differences in glucuronidation. Other variables did not show any 
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correlation. 



Multivariate Analyses 

Several multivariate predictive ANCOVA models were considered to identify the 
5 pretreatment measurements that predict ln(ANC nadir). The final model (r 2 =0.41) was 
selected by backward elimination from Table 10 and is presented Table 11. Pretreatment 
bilirubin level is found to be very significant and negatively related to ln(ANC nadir). 
Gender and -3156 genotype are found to be marginally significant after adjusting for the 
total bilirubin level. Ln(ANC nadir) is found to have a lower value in women, and it 
10 decreases with increasing number of (TA) 7 alleles (6/6>6/7>7/7). Other factors, such as 
ethnicity, number of prior regimens, performance status, and ln(pretreatment ANC) are 
not found to be significant predictors of ln(ANC nadir) after adjusting for -3156 
genotype, gender and total bilirubin. 

After determining the predictive model using pre-treatment variables, the post- 
15 treatment measurements of irinotecan AUC, SN-38 AUC, SN-38G AUC, and 
glucuronidation ratio were added to the model as independent variables with the intention 
of determining the possible mechanism of how the variability in UGT1A1 status affects 
ln(ANC nadir). The final model selected through backward elimination (r 2 =0.5141) 
which best predicts ln(ANC nadir) includes genotype and SN-38 AUC (p<0.001) (Table 
20 12). 

Toxic Death and Response 

One toxic death was reported, as the patient died of neutropenia-related sepsis. 
He was admitted to the hospital on day 7 of cycle 1 with fever and an no neutrophils 
detected (white blood cell count of 100 ul" 1 ). He was empirically treated with 
25 ceftazadime, tobramycin, and fluconazole, though no infectious source was ever 
identified. Despite support with granulocyte colony stimulating factor, the patient 
remained neutropenic, became septic, and died on day 11. He had 7/7 genotype and the 
highest level of pretreatment total bilirubin observed in these patients (1.2 mg/dl). 
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Concerning the response rates in this trial, three objective responses were 
observed. Two patients achieved a partial response (one with colorectal and the other 
with head and neck cancer) and had a 6/7 genotype. One colorectal cancer patient 
achieved a complete response and had a 6/6 genotype. 

Table 4 

UGT1A1 variants typed in this study. Positions indicated are from the first base of the 



UGT1A1 start site in the UGT1A cluster reference sequence (AF297093). 



Nucleotide 
change 


Amino acid 
change 


Exon 


-3156G>A 




Promoter 


-3279G>T 




Promoter 


TA indel 




Promoter 


211G>A 


G71R 


1 


686OA 


P229Q 


1 



Table 5 
Patient characteristics 



No. of patients 


Patients 




Entered 


66 


Assessable for toxicity 


63 


Assessable for response 


60 


Sex 




Male 


39 


Female 


27 


Age, median (range) 


60 (34-85) 


Ethnicity 




White 


50 


Black 


10 


Hispanic 


4 


Pacific Islander 


1 


Asian 


1 


Performance Status 




100% 


18 


90% 


31 


80% 


10 


70% 


17 


Tumor type 
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Colorectal 

Gastroesophageal 

Head and Neck 

Liver 

Lung 

Pancreas 

Unknown Primary 
Others 

Prior Radiotherapy 
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Table 6 



Frequency of UGT1A1 promoter haplotypes. 



-3279G>T 


-3156G>A 


TA indel 


Frequency 


T 


G 


6 


0.55 


G 


G 


6 


0.13 


G 


A 


7 


0.25 


G 


G 


7 


0.03 


G 


G 


8 


0.02 


G 


G 


5 


0.01 



Table 7 
Frequency of haplotype pairs 



Haplotype pairs 


Frequency 


TG6/TG6 


0.28 


TG6/GA7 


0.28 


TG6/GG6 


0.18 


GA7/GA7 


0.08 


GG6/TA7 


0.06 


TG6/GG7 


0.05 


TG6/GG8 


0.02 


GG6/GG8 


0.02 


GG5/TG6 


0.02 


GA7/GG8 


0.02 



The haplotypes reflect the change of -3279, -3156, and the TA indel variants, such as the 
first base refers to -3279 variants, the second to -3 1 56 variant and the number refers to 
the number of TA repeats. 

Table 8 



Genetic tests for the TA indel and -3156 genotypes 





Sensitivity 


Specificity 


PPV 


NPV 


7/7, grade 4 


0.50 (0.19-0.81) 


0.95 (0.85-0.98) 


0.50(0.19-0.81) 


0.95 (0.85-0.98) 


-3156 AA, grade 4 


0.50(0.19-0.81) 


0.96 (0.88-0.99) 


0.60 (0.23-0.92) 


0.95 (0.86-0.98) 


6/6, grade 0-3, 6/8=6/6 


0.57 (0.44-0.69) 


1.00 (0.61-1.00) 


1.00 (0.89-1.00) 


0.20 (0.10-0.37) 


6/6, grade 0-3, 6/8=6/7 


0.54 (0.41-0.66) 


1.00 (0.61-1.00) 


1.00 (0.89-1.00) 


0.19 (0.09-0.35) 


-3156 GG, grade 0-3 


0.63 (0.49-0.74) 


1.00 (0.61-1.00) 


1.00 (0.90-1.00) 


0.22 (0.11-0.41) 



PPV, positive predictive value. 
NPV, negative predicted value. 
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Data are shown with 95% CI in parenthesis. The patient with 5/6 genotype was regarded 
as having a 6/6 genotype. 

Table 9 

Pharmacokinetic parameters and by 6/6, 6/7, and 7/7 TA indel genotypes 



Irinotecan SN-38 a SN-38G Glucuronidation Ratio b 
TA indel No. of AUC AUC AUC (SN-38G AUC/SN-38 AUC) 
genotype patients (ng*h/ml) (ng*h/ml)(ng*h/ml) 



6/6 30 


24412.8 335.9 1954.2 


6.52 




(7691.6) (167.7) (1361.1) 


(3.98) 


6/7 25 


26085.5 458.4 1887.9 


5.55 




(10814.2) (379.8) (1682.5) 


(4.79) 


7/7 6 


25432.9 542.0 1819.1 


3.59 




(6694.9) (195.3) (1249.8) 


(2.81) 



a 6/6<6/7<7/7, z=2.13, p=0.03, non-parametric trend analysis. 
b 6/6>6/7>7/7, z=-2.16, p=0.03, non-parametric trend analysis. 



Data expressed as mean (standard deviation). 



Table 10 

Univariate analysis of ln(ANC nadir) 



Independent Variable 



SN-38 AUC °' 3523 <0 - 0001 

Pre-treatment total bilirubin 0.2979 <0.0001 

-3156 genotype 0.2413 0.0003 



Irinotecan AUC 



SN-38G AUC 



0.1273 0.0041 



Glucuronidation ratio 0.1171 0.0060 

Gender 0.0445 0.0971 



0.0411 0.1109 



Age > 70 0.0242 0.2231 

White ethnicity 0.0128 0.3764 

Ln(pre-treatment ANC) 0.0000 0.9749 

Performance status 0.0016 0.9923 
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Table 11 

ANCOVA for the final predictive model of In(ANC nadir) using pre-treatment 
variables 

5 





Coefficient 


SE 


p-value 


Intercept 


8.1885 


0.2767 


<0.001 


Genotype 








AA vs. GG+GA 


-0.9401 


0.3986 


0.022 


Gender 








Males vs. Females 


0.4323 


0.2001 


0.035 


Total Bilirubin 


-1.8452 


0.4816 


<0.001 



SE, standard error 

The overall model shows an r 2 value of 0.4048 (p<0.0001). 



Table 12 

ANCOVA for the final predictive model of ln(ANC nadir) using pre-treatment and 

1 0 post-treatment variables 

Coefficient SE p-value 

Intercept 8.3111 0.1517 <0.001 

Genotype 

AA vs. GG+GA -1.3798 0.3234 <0.001 

SN-38 AUC 

-0.0019 0.0003 <0.001 

SE, standard error 

The overall model shows an lvalue of 0.5128 (p<0.0001). 



15 All of the compositions and/or methods disclosed and claimed herein can be made 

and executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied 
to the compositions and/or methods and in the steps or in the sequence of steps of the 

20 method described herein without departing from the concept, spirit and scope of the 
invention. More specifically, it will be apparent that certain agents that are both 
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chemically and physiologically related may be substituted for the agents described herein 
while the same or similar results would be achieved. All such similar substitutes and 
modifications apparent to those skilled in the art are deemed to be within the spirit, scope 
and concept of the invention as defined by the appended claims. 
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